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Abstract 

Background: Solid tumors present a panoply of genomic alterations, from single base changes to the gain or loss 
of entire chromosomes. Although aberrations at the two extremes of this spectrum are readily defined, 
comprehensive discernment of the complex and disperse mutational spectrum of cancer genomes remains a 
significant challenge for current genome analysis platforms. In this context, high throughput, single molecule 
platforms like Optical Mapping offer a unique perspective. 

Results: Using measurements from large ensembles of individual DNA molecules, we have discovered genomic 
structural alterations in the solid tumor oligodendroglioma. Over a thousand structural variants were identified in 
each tumor sample, without any prior hypotheses, and often in genomic regions deemed intractable by other 
technologies. These findings were then validated by comprehensive comparisons to variants reported in external 
and internal databases, and by selected experimental corroborations. Alterations range in size from under 5 kb to 
hundreds of kilobases, and comprise insertions, deletions, inversions and compound events. Candidate mutations 
were scored at sub-genic resolution and unambiguously reveal structural details at aberrant loci. 
Conclusions: The Optical Mapping system provides a rich description of the complex genomes of solid tumors, 
including sequence level aberrations, structural alterations and copy number variants that power generation of 
functional hypotheses for oligodendroglioma genetics. 
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Background 

Cancer is fundamentally a disease of genomic origin. Al- 
terations in genes and regulatory elements critical to cell 
cycle control lead to uncontrolled cell growth and prolife- 
ration, the common signature of all cancers. Such events 
can cause amplification or mutational activation of onco- 
genes [1,2], deletion or mutation deactivation of tumor 
suppressor genes [3,4], orientation of genes with incorrect 
regulatory regions [5], gene fusion products [6], etc. As 
cancers evolve, they accumulate a cascade of mutations, 
ranging in size from a single nucleotide change to the gain 
or loss of entire chromosomes [7]. Coupled with the 
subclonal heterogeneity that is a hallmark of solid tumors 
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[8-10], obtaining a complete portrait of the genetic land- 
scape of human cancer remains a significant challenge. 

Synergy between revolutionary genomic tools and ad- 
vances in high-throughput computing has facilitated the 
development of a number of methods for detecting mu- 
tations. Chromosome banding and spectral karyotyping 
(SKY) [11] are low-resolution techniques used to detect 
large-scale chromosomal features. However, obtaining 
metaphase spreads for performing a karyotype is often 
difficult, especially when working with solid tumor biop- 
sies and paraffin embedded, formalin fixed tissue. Fluo- 
rescence in situ hybridization (FISH) and its variants are 
a family of molecular cytogenetic techniques developed 
to correlate specific sequences to cytogenetic observa- 
tions [12]. FISH offers higher resolution (compared to 
SKY) and has the advantage of not requiring metaphase 
spreads, but is limited by the fact that it requires a prior 
hypotheses about the locus of interest, making it unsuit- 
able for discovery based research. Hybridization based 
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microarray approaches, like SNP microarrays and array 
comparative genome hybridization (CGH), have been ex- 
tensively used to detect large scale amplifications and dele- 
tions in tumor genomes [13-15], but are unable to detect 
changes where there is no net gain or loss of DNA, such 
as inversions and balanced translocations, which have 
been shown to be an important mechanism for oncogenic 
transformation [16-18]. Moreover, microarrays do not 
offer structural information, necessitating follow-up exper- 
iments to identify the breakpoints and sequence context 
of the aberration. Microarrays are also restricted to re- 
gions of the genome amenable to unique probe design, 
which precludes repeat-rich regions and novel insertions 
that are hotbeds of variation and mutation [19-22]. Most 
commercial microarrays (except custom designed, high- 
density arrays) lose sensitivity below ~50 kb, and variants, 
particularly insertions, in this size range have remained 
largely unexplored, especially in cancer genomes [23]. 

The advent of massively parallel, short read DNA 
sequencing- the 'second generation' sequencing tech- 
nologies, and their application to cancer has also accel- 
erated the pace of mutation discovery. Initially applied 
to targeted subsets of the genome, such as specific gene 
families (e.g.: all protein kinases, or 'kinome') [24-27], or 
all the coding sequences (the exome) [28-33], second- 
generation sequencing is increasingly being used to in- 
terrogate whole cancer genomes [34-40]. In theory, 
second-generation sequencing of whole genomes has the 
ability to discern the full range of genomic alterations. 
In practice, however, more than 90% of events disco- 
vered by these platforms are less than 1 kb, and are 
biased towards deletions rather than insertions [23,41]. 
Second-generation sequencing instruments typically 
generate shorter reads with higher error rates from rela- 
tively short insert libraries, which present a significant 
computational and bioinformatic challenge in alignment 
and assembly [42]. Read-pair mapping approaches have 
successfully identified point mutations and indels in can- 
cer [36,38-40], but are limited by the insert size of the 
DNA library to detecting base substitutions and small 
indels [43] and are often confounded by repetitive re- 
gions of the genome. Further, accurate prediction of the 
exact breakpoints of an aberration depends on very tight 
size distribution of the DNA library, which can make 
library construction difficult [44]. Whole genome se- 
quencing followed by de novo assembly might mitigate 
some of these issues, but current assembly algorithms 
tend to collapse homologous sequences, and conse- 
quently dramatically under-represent repeats and seg- 
mental duplications that are known to be critical 
mediators of genomic rearrangement [42]. 

There remains a pressing need for discovery-based sys- 
tems that can provide a scalable, comprehensive view of 
the cancer genome in its entirety. In this study, we 



present Optical Mapping as one such system. Optical 
Mapping creates high-resolution ordered restriction maps 
of whole genomes through the analysis of ensembles of 
single molecule restriction maps. It has previously been 
used to map the genomes of microbes [45-48], plants 
[49-52] and mammals [53-57]. However, this is the first 
time it has been employed to analyze the genome of a 
solid tumor. Optical Mapping offers several unique advan- 
tages towards assembling the complex structure of a can- 
cer genome. Genomic DNA isolated directly from cells is 
analyzed, thereby obviating any bias introduced by ampli- 
fication or cloning steps. Moreover, because the DNA is of 
high-molecular weight (300 kb - >500 kb), segmental du- 
plications and other repeat-rich regions of the genome are 
revealed, and additionally, the structure and long-range 
context of any aberration are determined. Since the re- 
striction maps are made from single DNA molecules, Op- 
tical Mapping effectively pieces together heterogeneous 
alterations, which is especially important for tumor gen- 
ome analysis, as we demonstrate in oligodendroglioma. 

Oligodendrogliomas are frontal lobe tumors that are 
thought to arise from oligodendrocytes, supporting brain 
cells which provide myelination for neurons [58,59]. The 
concerted loss of heterozygosity (LOH) of chromosome 
arms lp and 19q, observed in 50-70% of patients, is a 
molecular signature of this malignancy [60]. The re- 
markably high prevalence of this molecular marker sug- 
gests that these regions harbor one or more tumor 
suppressor genes that might play an important role in 
the development of the tumor. Allelic losses of lp/19q 
have been correlated with positive response to chemo- 
and radiotherapy and prolonged survival for patients 
with oligodendroglioma [61]. However, it remains un- 
clear whether LOH of lp/19q is a prognostic biomarker 
for a more indolent tumor subtype that has fewer un- 
favorable mutations overall, rather than predictive of 
treatment sensitivity [62,63]. In fact, studies have shown 
that lp/19q codeleted tumors have slower growth rates 
and are more responsive to treatment than tumors with- 
out the codeletion [64,65]. In order to explore each of 
these possibilities, Optical Mapping was used to create 
physical maps from two individual oligodendroglioma 
tumor biopsies for the purpose of identifying and cha- 
racterizing structural changes on a whole genome basis. 

Results and discussion 

Optical map construction 

We used the Optical Mapping (OM) system to explore 
the genomic landscape of a solid tumor. Optical Mapping 
creates high-resolution physical maps of genomes through 
the analysis of ensembles of single molecule ordered re- 
striction maps. The tumor biopsies were disaggregated 
into single cells, then run through a Percoll gradient to en- 
rich for cancer cells (methods). High molecular weight 
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genomic DNA was extracted directly from these cells, 
stretched and immobilized in regular arrays on posi- 
tively charged glass surfaces using a microfluidic device 
(Figure 1A, details in methods) [66]. After deposition, 
the DNA was digested with the restriction enzyme Swal. 
The surface-bound restriction fragments remained in 



register, and were stained with a fluorescent dye and im- 
aged by automated fluorescent microscopy (Figure IB). 
Dedicated machine vision software calculated the size, 
in kilobase pairs (kb), of each fragment based on mea- 
surements of integrated fluorescent intensity, resulting 
in the high throughput, massively parallel generation of 
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Figure 1 (See legend on next page.) 
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(See figure on previous page.) 

Figure 1 An overview of the Optical Mapping system. A: Single cells, obtained from a slice of the tumor biopsy, are purified by a percoll 
density gradient, then mixed with agarose and allowed to solidify in a mold, forming rectangular inserts. Prior to mapping, cells are lysed within 
the insert, the DNA electrophoretically extracted, elongated and immobilized on an Optical Mapping surface by means of capillary flow through a 
microchannel device. The lower half of panel A is a representative image of properly elongated DNA (long, white horizontal lines) after surface 
digestion, stained with YOYO-1. Microchannels are 100 urn wide as indicated by the scale bar (grey bar). B: Enlarged image of surface-bound 
genomic DNA, digested with Swal, showing discrete restriction fragments separated by gaps. C: Automated machine vision detects DNA 
molecules (pseudocolored green), and calculates the mass of each fragment (white numbers), creating ordered restriction maps (Rmaps) from 
single DNA molecules (yellow bars). D and E: Strategy for constructing a genome-wide optical map starting from single molecule Rmaps. Rmaps 
are first clustered on a restriction map generated in-silico from the reference sequence of the human genome by pairwise alignment. Then 
consensus optical map contigs are constructed by de novo assembly of the Rmaps from a given window. Finally, the consensus map contigs are 
aligned back to the reference map, and differences are identified. 



ordered restriction maps, or 'Rmaps', from individual 
genomic DNA molecules (Figure 1C) [66]. The oligo- 
dendroglioma datasets comprise close to 700,000 such 
Rmaps, with an average size of greater than 400 kb 
(Additional file 1). The relative order and distance be- 
tween successive restriction fragments in a single mol- 
ecule optical map can be used to determine the precise 
location in the genome that gave rise to that molecule, 
by means of pair-wise alignment against an in-silico re- 
striction map [67]. The HF087 tumor dataset comprised 
235,026 aligned Rmaps, corresponding to 36.92 fold 
coverage of the human genome, while the HF1551 
dataset comprised 167,012 aligned maps, representing 
25.36 fold coverage of the human genome (Additional 
file 1). In the absence of a karyotype, our assessment of 
ploidy is based on optical map coverage and Affymetrix 
array analysis. Both these platforms, discussed in detail in 
subsequent sections, calculate chromosome copy number 
relative to normal, diploid genomes, and are in agreement 
that neither tumor sample is polyploid. They do, however, 
display aneuploidy, due to allelic losses of specific chromo- 
somes/chromosome arms (lp, 19q, 13 in HF087 and lp, 
19q, 14, 21 in HF1551), so if anything the coverage is likely 
to be higher than what we reported. 

The Rmaps that cluster together upon pair-wise align- 
ment were then assembled into consensus optical maps 
and analyzed for presence of structural variants using the 
bioinformatics pipeline described in [56]. The final con- 
sensus map contigs span 96.73% and 93.92% of the human 
genome for tumors HF087 and HF1551, respectively. 

Optical map coverage analysis 
Discernment of copy number variants 

Copy number was inferred from aligned coverage of 
Rmaps, prior to assembly, in a manner analogous to read- 
depth based methods for detecting copy number variants 
from second generation sequencing data (methods). 
Briefly, Rmaps were aligned to the in silico reference map, 
and then partitioned into discrete windows spanning each 
chromosome. These alignments were then compared to 
alignments of a reference data set (comprising of a num- 
ber of normal genomes) that was used to 'normalize' the 



observed coverage. This was necessary because the num- 
ber of Rmaps that align to a particular region of the ge- 
nome depends, in part, on the density of restriction sites 
in that region, which varies from chromosome to chromo- 
some (ranging from a low of 2.5 cuts/ 100 kb on chromo- 
some 22 to a high of 9.25 cuts/100 kb on chromosome 4). 
A Hidden Markov Model (HMM) was then fitted to this 
data, and copy number changes were detected (Figure 2A) 
[68,69]. Optical map coverage analysis confirmed the al- 
lelic loss of chromosome arms lp and 19q in HF087 and 
HF1551. The breakpoints appear to be very close to the 
centromere, consistent with the proposed mechanism of 
an unbalanced reciprocal translocation mediating the 
LOH event [70,71]. Additionally, coverage analysis also 
detected allelic loss of chromosome 13 (HF087), 14 and 21 
(HF1551), which are known to be rarer events associated 
with oligodendroglioma. 

Solid tumor heterogeneity 

The genome wide optical map of HF1551 was created using 
DNA from two adjacent slices of the tumor: 446,933 
(-55%) Rmaps originated from slice 1 and 202,974 (-45%) 
Rmaps from slice 2 (Figure 2). Interestingly, when the 
Rmaps were partitioned according to the slice they origi- 
nated from, and coverage analysis was performed sepa- 
rately, unique copy number profiles were obtained for each 
slice. In addition to allelic losses of lp and 19q, slice 1 also 
had LOH of chromosomes 14 and 21, while slice 2 had evi- 
dence of LOH of 19p (Figure 2B). Solid tumors are dynamic 
aggregates of continually evolving subclones, resulting in 
spatial and temporal genetic heterogeneity. Our findings 
suggest that the tumor slices used for Optical Mapping 
evolved from distinct cancer cell clones, and is congruent 
with recent evidence of branched evolutionary tumor 
growth [72-74] . Although assembly of whole genome maps 
on a per slice basis was not feasible due to insufficient 
depth of coverage, our results establishes proof-of-principle 
of Optical Mapping to interrogate tumor heterogeneity. 

Discovery of optical structural alterations 

The optical consensus maps generated by map assembler 
were aligned to the in silico restriction map (generated 
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Figure 2 Intra-tumor heterogeneity. A: Copy number profiles, inferred from analysis of optical map coverage of tumor HF087 and HF1551. For 
each panel, the x-axis is co-ordinates of the human genome (chromosome numbers are indicated at the top), and y-axis is counts of Rmaps that 
align to a particular genomic interval. The grey curve plots the observed number of counts in an interval, and the red line indicates the sequence 
of underlying copy number states (also called the Viterbi path). B: Copy number analysis of tumor HF1551 by slice. Slice 1 (green) has LOH of 
chromosomes 1p, 1 9q, 14 and 21, while slice 2 (blue) has losses of chromosomes 1p and 19 only. 



from build 35 human reference sequence), and by com- 
parison of the order and sizing of the 219,224 restriction 
fragments (fragments smaller than 0.4 kb in size were 
merged) between the experimental and the reference map. 
Such comparisons revealed structural variants in the ex- 
perimental genome that were classified as four types: extra 
cuts (EC), where the optical consensus map displays a re- 
striction site that was not predicted by the reference se- 
quence; missing cuts (MC), where a cut that was predicted 
was not observed in the experimental map; insertions 
(INS), where the size of a fragment in the consensus map 
was significantiy larger than its counterpart in the refer- 
ence map; deletions (DEL), where a fragment in the ex- 
perimental map was smaller than the corresponding 
reference fragment (or missing altogether); and finally, 
complex events (OTHER) involving multiple cut or size 
differences (methods). Approximately a third of the ECs 
and MCs represent small indels that are below the reso- 
lution of Optical Mapping (~3 kb) [56]. Figure 3C shows 
an example of each class of variant detected by Optical 
Mapping. 

At first glance, it might appear that any one of these 
variants could be attributed to errors inherent in Optical 
Mapping. For instance, a missing cut could be due to 



incomplete digestion, an extra cut could result from spuri- 
ous cutting by the restriction enzyme, or physical breakage 
of the DNA molecule, and uneven staining could lead to 
inaccurate estimation of fragment size. However, the high 
throughput advantage of Optical Mapping allows us to 
distinguish such random errors from legitimate genomic 
events. Any alteration in the optical consensus map was 
supported by multiple single molecule maps (Rmaps), 
each representing an independent observation at that 
locus. The Optical Mapping error models estimated the 
statistical significance of each structural variant, after tak- 
ing into account the quality and quantity of the data [56] . 

A total of 1081 and 1085 differences were detected in 
HF087 and HF1551 respectively (Figure 3A and B). The 
distribution of structural variants across the genome is uni- 
form and the pattern is similar for both tumors (Figure 3A). 
Variants range in size from single base differences to com- 
plex genomic events spanning hundreds of kilobases 
(Figure 4). Approximately 800 single base changes were 
detected in each tumor, including point mutations (such as 
the example depicted in Figure 4A), polymorphic SNPs 
where only one allele has a Swal restriction site (referred to 
as snip-SNPs [75]), and small indels that create or remove a 
Swal cut site but are below the detection limit of Optical 
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Figure 3 Genome-wide distribution of optical structural alterations (OSAs) detected in oligodendroglioma. A: Horizontal yellow bars, 
numbered on the left, represent human chromosomes (heterochromatic regions are in grey). Tick marks depict locations of structural variants 
from HF087 (red) and HF1551 (blue). B: The total number of events detected in each tumor sample, also broken down by category. 
C: An example of each class of variant is shown in the inset figure. 



Mapping. 179 indels with a median size of 6.6 kb were 
detected in each sample (Figure 5). For comparison, the 
median size of indels reported in the Database of Genomic 
Variants is 2.3 kb (Figure 5, inset). -70 complex events 
were found in each tumor, including known polymorphic 
loci such as the major histocompatibility complex (MHC), 
giving us confidence that these results are not spurious. 
Optical Mapping also discerns balanced genomic events, 
where there is no net gain or loss of genomic sequence. A 
putative inversion spanning 352 kb of chromosome 7 was 
observed in HF087 which appears to disrupt the ZNF92 
gene (Figure 4C). Finally, the largest events detected by Op- 
tical Mapping include gains or losses of entire arms of 
chromosomes, for example, the allelic loss of chromosome 
1 illustrated in Figure 4D, and discussed in detail previously. 
Intersection counts with genes, segmental duplications, 
published SNPs (dbSNP build 135, http://www.ncbi.nlm. 



nih.gov/projects/SNP/) and published structural variants 
(Database of Genomic Variants, November 2010 release) 
are shown in Additional file 2. Comprehensive breakdown 
of the overlaps are shown in Additional file 3. 

Optical Mapping provides a comprehensive description 
of the vast and complex landscape of cancer genomes. 
The ability to study the genome in its entirety, including 
non-genic or repetitive regions using a single technology 
minimizes ascertainment bias. As detailed in subsequent 
sections, it is employed to generate a list of candidate can- 
cer genes that is not hypothesis-limited, and elucidate 
their structure at sub-genic resolution. 

Validation of copy number and structural variants 
Experimental validation: SNP array 

The Affymetrix Genome Wide Human SNP 6.0 Array, 
which has probes for detection of both SNPs and copy 
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Figure 4 Spectrum of genomic alterations in oligodendroglioma. A: ~800 single base alterations were found in each tumor, like the G > A 
transition in the STMN2 gene shown. B: A ~8 kb insertion from tumor HF1551. 179 such indels were detected per sample. C: A 352 kb inversion 
that disrupts pseudogene INTS4L1 and encompasses the zinc-finger transcription factor ZNF92. D: Loss of one copy of chromosome 1 p, 
a hallmark of oligodendroglioma. 



number variants (CNVs), was used to validate our 
findings. 

Both platforms concurred on the LOH of chromosomes 
lp, 19q and 13, but allelic loss of chromosome 14 in 
HF1551 was not detected by the Affymetrix array. The 
copy number profile generated by running the HMM al- 
gorithm on the maps from the first slice of tumor HF1551 
was similar to that from the array, which suggests that the 
DNA originated from tumor sections that were in closer 
proximity. 

Many of the SNP probes on the Affymetrix chip corres- 
pond to Swal snip-SNPs. Hence, the array data was used 
to validate ECs and MCs. We observed 100% (62/62 in 
HF087, 44/44 in HF1551) concordance between the SNP 
genotype and the Swal cut pattern at all overlapping cut 
differences in both tumors (Additional file 4). 

The copy number variants detected by the array were 
also compared to Optical Mapping indels. Signal inten- 
sities from the chip were normalized by global median 
scaling, and copy number was assessed using several dif- 
ferent algorithms (methods), relative to a reference model 
file generated from the 270 HapMap samples. Though the 
resolution of array CGH is much lower than Optical Map- 
ping, we were able to validate 24 structural variants in 
tumor HF087 and 16 in tumor HF1551 (Additional file 5). 



Experimental validation: PCR 

The nature of many of the structural variants, being within 
repetitive portions of the genome, but detected by Optical 
Mapping unfortunately precludes their comprehensive 
validation by simple PCR techniques. Accordingly, we se- 
lected two variants that were amenable to PCR and over- 
lapped genes that may offer insights into the chemo- and 
radio-sensitivity of oligodendroglioma. These loci were 
then PCR amplified, cloned and sequenced (methods). 

The optical map shows an EC in the PARK2 gene in 
HF1551 (Figure 6A). PARK2 is a putative tumor sup- 
pressor, and mutations in this gene have been reported 
in multiple cancer types (detailed in 'candidate muta- 
tions' section of this document). An 848 bp amplicon 
spanning the predicted location of the EC was obtained 
(Figure 6B), and Sanger sequencing proved that a G to T 
transversion resulted in the creation of a new Swal restri- 
ction site (Figure 6C). 

We also validated an EC in tumor HF087 that occurred 
in the STMN2 gene (Figure 7A). As discussed in subse- 
quent sections, STMN2 regulates microtubule dynamics 
and is believed to be a target of beta-catenin/TCF signal- 
ling. We amplified a 1003 bp region around the putative 
mutation (Figure 7B), and were able to validate the alter- 
ation via sequencing (Figure 7C). 
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Figure 5 Size distribution of indels found by Optical Mapping. Histogram of indel sizes detected by Optical Mapping. The x-axis is size of 
indel in kilobase pairs, and the y-axis is number of events. For comparison, a similar graph is shown (inset) of the distribution of indel sizes from 
the Database of Genomic Variants. 



Comparative validation 

We also validated our findings by comparing them to 
two sources- Optical Mapping data from several nor- 
mal genomes, and publicly available SNP and structural 
variant data. First, oligodendroglioma structural vari- 
ants were compared against structural variants found 
by Optical Mapping of 6 other normal human genomes 
by our laboratory. This internal database includes: 
(three lymphoblast-derived cell lines and a complete 
hydatiform mole (dbVar study ID nstd49) [56], a 
lymphocyte-derived cell line (unpublished) and an early 
passage human embryonic stem cell line [55]). 80%- 
90% of oligodendroglioma variants were also detected 
in at least one of the normal human genomes 
(Additional file 6), suggesting that such loci are poly- 
morphic, and affirming the veracity of our findings. 
Then, oligodendroglioma structural variants were com- 
pared against variants in the Database of Genomic 
Variants (DGV). The DGV is an extensive catalogue of 
structural variation in normal humans, currently hold- 
ing 101,923 events detected by a variety of platforms. 
We observed the greatest concordance with variants 
found by fosmid-end sequencing (~15%) and high 
density oligonucleotide array CGH (-10%) (Additional 
file 6). Finally, cut differences detected by Optical 
Mapping were compared to published SNPs. Detailed 
breakdown of these intersections are shown in 



Additional file 3; parameters for comparisons are de- 
scribed in the Methods section. 



Candidate mutations 

Separation of mutational and polymorphic OSAs 

The ultimate goal of our mapping efforts was to identify 
genes or genomic elements that maybe important to the 
biology of oligodendroglioma, with the caveat that such 
'candidates' represent hypotheses requiring rigorous 
testing to establish their functional role in tumorige- 
nesis. Distinguishing between structural polymorphisms 
and somatically acquired mutations is a key step to- 
wards accomplishing this goal. Unfortunately, matched 
normal DNA from the individuals whose tumors were 
optically mapped was not available. Instead, we adopted 
a stringent filtering scheme to remove putative poly- 
morphisms and enrich for somatic mutations, based on 
comparisons to internal and publicly available data (de- 
scribed above). Parameters for these comparisons were 
determined based on the Optical Mapping error model 
and designed to be extremely parsimonious (methods). 
As a result of these operations, we arrived at a total of 
21 somatic mutations (5 genes) in HF087 and 73 som- 
atic mutations (21 genes) in HF1551. Since two muta- 
tions are seen in both tumors, 24 unique candidate 
cancer genes were identified in oligodendroglioma 
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Forward primer: 5' TCAAGCAAACACTGATATACCCAG 3' 
Reverse primer: 5' TCACTAAACTCACTTGCCCCTAAC 3' 



Inormall 5' . . TAATATTAATTTAAAGCACCAG AGCAG AATG . . .3' 



Itumorl 5' . . .TAATATTAATTTAAATC AC C AG AG C AG AATG ... 3 

Swal cleavage site 

Figure 6 Experimental validation of PARK2 mutation by PCR- 
sequencing. A: EC in PARK2 gene on chromosome 6 of HF1551. 
The enlarged figure shows the position of the PCR amplicon. B: 
Restriction digest of the PCR amplicon. The undigested amplicon is 
848 bp. Digestion with Swal restriction enzyme is expected to yield 
two fragments of 577 bp and 271 bp (based on the location of the 
EC in the optical map). An addition digestion was performed with 
Nhel enzyme to ensure the correct amplicon was being analyzed. 
The expected sizes of the Nhel fragments are 700 bp and 148 bp. C: 
Sequence of the PCR amplicon showing the G >Ttransversion that 
creates a new Swal cut site. 



(Table 1). A few interesting candidate genes will be 
discussed in the next section. 

Candidates common to both HF087 and HF1551 

Two candidate genes, NPAS3 and OSBPL3, harbored mu- 
tations in both tumor samples (Figure 8). NPAS3 (neuronal 
PAS domain protein 3) shows a complex event accom- 
panied by a ~7 kb gain in the HF087 optical map, and a 
missing cut in the HF1551 optical map (Figure 8A). 



This neuronally expressed basic helix-loop-helix tran- 
scription factor has been implicated in schizophrenia 
[76,77] and bipolar disorder [77], and is frequently de- 
leted or inactivated in many cancers. Recently, it has 
been demonstrated that NPAS3 exhibits features of a 
tumor suppressor which drives late progression of ma- 
lignant astrocytomas, and is a negative prognostic 
marker for survival [78] . 

Both tumor optical maps display cut differences in the 
OSBPL3 (oxysterol binding protein like-3) gene (Figure 8B). 
This gene plays a vital role in cell adhesion, cytoskeletal 
organization and lipid metabolism [79-81]. It is highly 
expressed in B-cell associated malignancies [82,83], where 
it is one of the common sites of retroviral integration [84]. 
An independent study that used exon sequencing to study 
oligodendroglioma also found somatic mutations in 
OSBPL3 [30]. 

Candidates observed in either HF087 or HF1551 

In the HF1551 optical map, we observe a point mutation 
that creates a Swal restriction site in the PARK2 gene 
(Figure 6). This gene encodes an E3 ubiquitin ligase, called 
Parkin that catalyzes the ubiquitination of a variety of 
target proteins for proteasome mediated degradation. 
Germline mutations in PARK2 have long been known to 
cause autosomal recessive juvenile Parkinson's disease 
[85-87]. More recently, PARK2 has been identified as a 
tumor suppressor gene in Glioblastoma multiforme, 
breast, ovary, lung, colorectal and liver cancers [28,88-94]. 
It encompasses most of FRA6E, the third most active 
common fragile site in the human genome [95], and 
shares the characteristics of other tumor suppressors such 
as FHIT and WWOX, that also occur in fragile sites. 
PARK2 is frequently deleted or inactivated in cancer cell 
lines and primary tumors [88,92], and concomitantiy, Par- 
kin expression is either significantly diminished or absent 
[89,92]. Unlike classical tumor suppressors where biallelic 
inactivation is necessary for oncogenesis, heterozygous 
mutations in PARK2 are sufficient to confer a growth ad- 
vantage during tumor development [88,92]. Restoring 
Parkin expression in Par/dM-deficient cell lines reduces 
their profileration in vitro [92], while injection of Parkin- 
deficient cells into immunocompromised mice generate 
tumors in vivo [91]. Interestingly, PARK2 also mediates 
chemosensitivity in breast cancer via microtubule 
dependent mechanism [93,96-98]. 

STMN2 (stathmin-like 2) is another interesting candi- 
date gene. We observe a point mutation in this gene in 
tumor HF087 (Figure 7). STMN2 is a neuron specific 
member of the stathmin family of small regulatory phos- 
phoproteins which control cell profileration and differ- 
entiation [99]. It is up-regulated in liver cancer and has 
been identified as a target of p-catenin/TCF-mediated 
transcription [100]. STMN2 sequesters soluble tubulin, 
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Forward primer: 5' TGCTTCCTAAAAGAGTCCGTAG 3' 
Reverse primer: 5' GTCAGCCCTCAATTTCAGATG 3' 



5' . . . GTATTTAAAATTTAA TATAATCTTGGATGC...3 



Itumorl 5'...GTATTTAAAATTTAAATATAATCTTGGATGC...3' 

Swal cleavage site 

Figure 7 Experimental validation of an EC in the STMN2 gene 
by PCR-sequencing. A: EC in STMN2 gene on chromosome 8 of 
HF087. The enlarged figure shows the position of the PCR amplicon. B: 
Restriction digest of the PCR amplicon. The undigested amplicon is 
1003 bp. Digestion with Swal restriction enzyme is expected to yield 
two fragments of 519 bp and 484 bp (based on the location of the EC 
in the optical map). An addition digestion was performed with Nhel 
enzyme to ensure the correct amplicon was being analyzed. The 
expected sizes of the Nhel fragments are 616 bp and 387 bp. C: 
Sequence of the PCR amplicon showing the G > A transition that 
creates a new Swal cut site (blue triangle). 



forming a ternary complex, inhibits microtubule assem- 
bly and induces their disassembly [101]. Its highly simi- 
lar, but more well-studied paralog STMN1, located on 
chromosome lp, is known to sensitize cells to anti- 
microtubule drugs in glioma [102,103], breast [104,105] 
and prostate cancer [106].In light of recent studies dem- 
onstrating the synergistic epistasis between paralogous 
genes involved in essential cellular functions and its thera- 
peutic implications [107,108], we speculate that STMN1 



and STMN2 might be functionally redundant, and inacti- 
vation of STMN2 might, in part, explain the treatment 
sensitivity of oligodendroglioma. 

In the HF1551 optical map, we see an extra cut in the 
gene ZFYYE26 (zinc finger, FYVE domain containing 26). 
Spastizin, the zinc finger protein encoded by ZFYVE26, 
causes the neurological disorder hereditary spastic para- 
plegia [109]. This gene binds to the tumor suppressor 
Beclin-1 and regulates cytokinesis [110,111], and is recur- 
rently mutated in breast cancer [29] . 

We detected a 485 kb inversion on 7qll.23 in tumor 
HF1551. Hemizygous deletions spanning a 1.5-1.8 MB re- 
gion of this locus cause the neurodevelopmental disorder, 
Williams-Beuren (WB) syndrome [112]. However, inversion 
of this region is polymorphic, and is present in -6% of the 
general population, and in ~25% of transmitting parents in 
WB families [113-115]. Given the disparity in size between 
the aberration detected by OM and reported instances of 
the WB inversion, it is possible that the event we observe 
arose de novo and is distinct from the 'canonical' inversion. 
To test this hypothesis, we ran several targeted assemblies 
on the WB region. The general strategy for this approach 
was to modify the reference map in-silico to reflect our hy- 
pothesized structure (Figure 9A), and then use the iterative 
assembly framework described earlier to pull out individual 
restriction maps and generate an optical consensus map 
(methods). Since our map assembly pipeline was designed 
to provide the single most conservative answer, this ap- 
proach is helpful in detecting large-scale aberrations that 
are significantly different from the reference sequence. Of 
the eight modified reference maps we started with, the one 
that reflected the canonical WB inversion/deletion did 
not grow, while the one that reflected the 485 kb event 
successfully generated a consensus map that spanned it 
and flanked contiguous regions on chromosome 7 
(Figure 9B). The optical consensus map also closes the 
putative sequence gap immediately to the right of the 
inversion, and in fact, approximately half of the 
sequence gaps in the reference genome (NCBI, build 
35) are spanned by optical consensus maps. The 
inversion encompasses the genes GTF2IRD2, PMS2P5, 
WBSCR16, GTF2IRD2B and NCF1, and its breakpoints 
appear to disrupt the genes GTF2I and STAG3L2. In 
the absence of matched normal DNA, it is impossible 
to ascertain if the inversion we detected was inherited 
through the germline or somatically acquired, however, 
this is the first report, to the best of our knowledge, of 
inversions in the WB region in the context of cancer. 

Candidates on lp or 19q 

The concerted loss of chromosome arms lp and 19q is a 
hallmark of oligodendroglioma. Seen in 50%-70% of tu- 
mors, it is believed that these regions harbor one or more 
tumor suppressor genes that play an important role in the 



Ray et al. BMC Genomics 2013, 14:505 
http://www.biomedcentral.com/1471 -21 64/1 4/505 



Page 11 of 21 



Table 1 Candidate cancer genes identified in oligodendroglioma 



Gene symbol 


Gene name 


Entrez gene ID 


Location 


Tumor sample 


Al \ /IC 1 
ALIVI b I 


Alstrom syndrome 1 


7Q/n 
/o4U 


CnrZ,/J5t> IZOOJ,/DOD/u40 


Mr 1 jj 1 


APPL1 


adaptor protein, phosphotyrosine interaction, 
PH domain and leucine zipper containing 1 


26060 


chr3,57261 764,57307498 


HF1551 


ARHGAP10 


Rho GTPase activating protein 10 


79658 


chr4,1 48653452,1 48993927 


HF1551 


CCDC91 


coiled-coil domain containing 91 


55297 


chrl 2,2841 01 32,28703099 


HF1551 


CECR2 


cat eye syndrome chromosome region, candidate 2 


27443 


chr22,1 7956627,1 8033845 


HF1551 


DIAPH2 


diaphanous homolog 2 (Drosophila) 


1730 


chrX,95939661, 96724837 


HF087 


EFHC2 


EF-hand domain (C-terminal) containing 2 


80258 


chrX,440071 27,44202923 


HF1551 


EIF1 


eukaryotic translation initiation factor 1 


10209 


chrl 7,398451 26,39847898 


HF1551 


LASS3 


LAG1 homolog, ceramide synthase 3 


204219 


chr15,100940599,101084925 


HF1551 


LOC339166 


uncharacterized RNA coding gene 


339166 


chrl 7,5675553,5834016 


HF1551 


LRRN2 


leucine rich repeat neuronal 2 


10446 


chrl, 204586302,204654481 


HF1551 


MYOF 


myoferlin 


26509 


chrl 0,950661 85,95242074 


HF1551 


NPAS3 


neuronal PAS domain protein 3 


64067 


chrl 4,33408458,34273382 


HF087, HF1551 


OSBPL3 


oxysterol binding protein-like 3 


26031 


chr7,248361 63,2501 9760 


HF087, HF1551 


PARK2 


Parkinson disease (autosomal recessive, juvenile) 2, parkin 


5071 


chr6,161768589,163148834 


HF1551 


PAX7 


paired box 7 


5081 


chrl ,1 8957499,1 9075360 


HF1551 


PHLDB2 


pleckstrin homology-like domain, family B, member 2 


90102 


chr3,1 11451326,1 11695364 


HF1551 


PLEKHM3 


pleckstrin homology domain containing, family M, member 3 


389072 


chr2,20868601 1 ,208890284 


HF1551 


PRKG1 


protein kinase, cGMP-dependent, type I 


5592 


chrl 0,5275091 0,540581 10 


HF1551 


SIPA1L3 


signal-induced proliferation-associated 1 like 3 


23094 


chrl 9,38397867,38699008 


HF087 


STMN2 


stathm in-like 2 


11075 


chr8,80523048,80578410 


HF087 


TACC2 


transforming, acidic coiled-coil containing protein 2 


10579 


chr10,123748688,124014057 


HF1551 


TCEB3 


transcription elongation factor B (Sill), polypeptide 3 


6924 


chrl, 24069855,24088549 


HF1551 


ZFYVE26 


zinc finger, FYVE domain containing 26 


23503 


chrl 4,6821 3236,68283306 


HF1551 



development of this cancer. Hence, somatic mutations on 
these chromosome arms are particularly interesting. We 
found putative mutations on 2 genes residing on chromo- 
some lp (TCEB3, PAX7) and 1 gene on 19q (SIPA1L3). 
The roles of these genes in normal and disease states, and 
the structural variants we found in them are discussed 
briefly in the subsequent section. 

We observe a 6.3 kb deletion that potentially ablates the 
first exon of TCEB3 in tumor HF1551. TCEB3 (transcrip- 
tion elongation factor B, polypeptide 3) encodes the tran- 
scriptionally active subunit of the mammalian elongin 
complex [116,117]. This elongation factor stimulates the 
rate of transcription by suppressing the transient pausing 
of RNA polymerase II on the DNA template [118]. TCEB3 
is part of a multi-protein complex that functions as an 
elongin-based ubiquitin ligase [119], similar to the Von 
Hippel-Lindau (VHL) tumor suppressor complex, by me- 
diating DNA damage induced ubiquitination and degra- 
dation of polymerase II [120]. 

Tumor HF1551 also has an insertion in the lp-encoded 
gene PAX7 (paired box 7). The PAX genes encode a family 
of transcription factors that control development within 



the neural, myogenic and lymphoid lineages [121]. PAX7, 
in particular, is essential for survival, proliferation and mi- 
gration of myogenic progenitor cells [122], and cell fate 
decisions in the developing nervous system [123]. PAX7 is 
the target of a recurrent gene fusion with the forkhead 
protein FKHR/FOXOl that is found in -15% of patients 
with alveolar rhabdomyosarcoma [121,124]. The fusion 
transcript is much more abundant and transcriptionally 
active than wild type PAX7 [125], suggesting that the 
deregulation of PAX7 downstream target genes contribute 
to tumorigenesis. 

In the HF087 optical map, we observe a missing cut in 
the gene SIPA1L3 (signal induced proliferation associated 1 
like 3) which is located on the long arm of chromosome 19. 
This gene encodes a Ras specific GTPase activating protein 
that is found at epithelial junctional complexes. These com- 
plexes play a crucial role in mechanical adhesion between 
epithelial cells to form cellular sheets and in the 
organization of actin cytoskeleton [126]. Somatic mutations 
in SIPA113 have been discovered in cancers of the brain 
[127,128], prostate [129], breast [24], ovary [32], pancreas 
[130], colon [24], skin [131] and hematopoietic system [34], 
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Figure 8 Candidate genes harboring OSAs in both HF087 and HF1551. A: NPAS3 which bears a complex alteration in HF087 and an EC in 
HF1551. B: 0SBPL3 which harbors cut differences in both tumor samples. 





but a cohesive picture of the functional role that this gene 
plays in these diverse cancer types is yet to emerge. 

Taken together, the candidate genes discovered by Op- 
tical Mapping point to critical roles of transcriptional 
control and cytoskeletal organisation in the etiology of 
oligodendroglioma. 

Non-genic candidates 

Protein coding sequences comprise less than 2% of the hu- 
man genome. The vast non-coding portion of the genome, 
once believed to be 'junk DNA is rife with functional ele- 
ments that orchestrate the gene expression program of 
cells. Recent evidence from the ENCODE (Encyclopedia of 
DNA Elements) consortium indicates that as much as 
80.4% of the human genome encodes a defined product 
(for instance, a non-coding RNA) or displays a reprodu- 
cible biochemical signature (for instance, a specific chro- 
matin structure) [132]. Such signatures, either alone or in 
combinations, mark genomic sequences with important 
functions, such as promoters, enhancers, insulators and si- 
lencers [133]. The ENCODE data sheds some light on pos- 
sible functional roles of Optical Mapping candidates that 
are not located within genes. A number of these candidates 
are actively transcribed, for instance an EC on chromo- 
some 5 of HF1551 overlaps the transcribed pseudogene 
GUSBP9. Several non-genic variants occur within long 
intergenic non-coding RNAs (lincRNA) coding regions 
(Additional file 7). Both these classes of genomic elements 
provide an additional tier of gene regulation, and contrib- 
ute significandy to the transcriptional landscape of human 
cancers [134,135]. Several candidates also show interesting 
changes in their putative functions in cancer tissues. For 
example, we observe a MC on chromosome 2 (HF087) in a 



genomic region bearing a histone modification pattern 
characteristic of insulators in multiple different normal cell 
types, but the pattern changes to that of an enhancer in he- 
patocellular carcinoma. 

Optical Mapping provides a global view of the cancer 
genome, free from biases introduced by cloning, amplifica- 
tion or hybridization, and discovers structural variation and 
mutation on a scale ranging from kilobases to megabases. 
Moreover, since the platform uses high-molecular weight 
DNA as analyte, the long-range context and connectivity of 
each variant is preserved, potentiating meaningful interpre- 
tation of candidate genes. However, Optical Mapping does 
not provide single-base resolution. Point mutations or 
indels spanning a few base pairs, such as the events fre- 
quently observed in CIC and FUBP1 genes in lp/19q 
codeleted oligodendrogliomas, are below the lower limit of 
resolution and would remain undetected (unless they create 
or destroy a Swal restriction site). 

Biological significance of candidates identified by 
optical mapping 

The aim of this study is to generate new hypotheses for 
oligodendroglioma genetics, and as such, functional stu- 
dies are beyond the scope of this paper. However, by sur- 
veying publicly available data on the candidates discerned 
by Optical Mapping, we can gain some insight into the 
roles they might play in malignant transformation. 

Moving beyond the two tumors HF087 and HF1551, we 
wanted to take the candidate cancer genes and analyze 
them in the context of other genome-wide studies. Most 
somatic mutations in cancer cells arise due to genomic in- 
stability and do not contribute to tumorigenesis. However, 
mutations in genes that promote tumor development, so- 
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Figure 9 Strategy for assembling the -500 kb inversion in the Williams-Beuren region in HF1551. A: Construction of the modified, 
'hypothesis' reference map for directed assembly. The map has a -500 kb inversion in the center, flanked on either side by 500 kb of sequence 
that agrees with the reference map. B: After 8 iterations of map assembly, an optical consensus map is obtained that spans the hypothesized 
reference, and has multiple Rmaps that bridges across both left and right breakpoints. 



called 'driver' mutations tend to be recurrent. To assess 
the extent of recurrence of our candidates across a large 
number of samples, we used the Catalog of Somatic Muta- 
tions in Cancer (COSMIC). The COSMIC database is a 
comprehensive archive of somatic mutations in human 
cancer, combining manually curated data from scientific 
literature and the output from the Cancer Genome Project 
[136,137]. 10 genes from our list of candidates had muta- 
tion frequencies (number of unique mutated samples di- 
vided by the total number of unique samples) greater than 
10%, with the top hits being MYOF (19.1%), CECR2 
(18.8%) and ZFYVE26 (18.7%). Mutation frequencies for 
all the candidate genes are listed in Table 2. 

Since cells can employ a number of mechanisms to 
compensate for loss or mutational inactivation of genes, 
a more direct way of assessing the functional role of a 
given candidate gene is to analyze changes in its pattern 
of expression between normal and disease states. Array 
based expression profiling of tumors HF087 and HF1551 
was performed by Fine et. al. [138], and is publicly avail- 
able through the NCBI GEO database [139], accession 
GSE4290. Differential expression analysis carried out 
using the EBarrays algorithm [140] shows that 5 genes 



(NPAS3, STMN2, ZFYVE26, PHLDB2 and PLEKHM3) 
from our list of 24 candidates is significantly up or down- 
regulated (p-value 1E-03 or less). A complete list of differ- 
entially expressed genes can be found in Additional file 8. 
To assay for functional effects in an even larger population 
of tumors, we queried for changes in expression of our 
candidate genes in REMBRANDT, a database of molecular 
data on brain tumors (National Cancer Institute, 2005, 
REMBRANDT home page https://caintegrator.nci.nih. 
gov/rembrandt/, accessed 13 th August 2012). The results 
are reported for each gene as the number of oligodendro- 
glioma samples in the database that are differential expres- 
sion by at least two-fold (Additional file 9). All but 3 of 
our candidate genes were differentially expressed in at 
least 10 tumor samples. Congruent with the previous ana- 
lysis, NPAS3, STMN2, ZFYVE26 and PHLDB2 are the 
most frequently deregulated candidate genes. 

Finally, we asked what biological processes and path- 
ways are significantly enriched or depleted in our list of 
candidates. This can identify fundamental cellular mech- 
anisms that contribute to cancer development. As a 
whole, these candidate genes are enriched for proteins 
involved in cytoskeletal organization (p-value = 0.00223, 
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after correcting for multiple testing, Ontologizer [141]). 
Our candidate genes are also significantly enriched for 
microRNA binding targets (p-values between 0.0144- 
0.0198 after correcting for multiple testing, WebGestalt 
[142]). Approximately half of the over-represented sites 
have been associated with binding of cancer-related 
microRNAs [143], underscoring the importance of post- 
transcriptional control of expression in oligodendroglioma. 

While these results are not a direct indicator of aber- 
rant function, this is a demonstration that Optical Map- 
ping results can be expanded to clinical samples and 
used to create direct functional hypotheses. 

Conclusions 

We have applied Optical Mapping to explore the genomic 
landscape of solid tumor oligodendroglioma. ~2100 dis- 
crete structural variants have been discovered, ranging in 
size from single base changes to loss of entire chromo- 
somes. The structure of each alteration has been elucidated 
at sub-genic resolution, while retaining the long-range con- 
text of the event. 94 somatic mutations have been identi- 
fied, 24 of which affect genes. These novel candidate cancer 
genes provide focused, testable hypotheses for follow-up 
functional investigation. We believe that Optical Mapping 
provides a comprehensive, high-resolution description of 
the complex and disperse genomes of solid tumors. 

Methods 

Selection of tumors 

The tumors used in this study originated from the tissue 
bank at the Hermelin Brain Tumor Center/Department of 
Neurosurgery, Henry Ford Hospital (provided by Dr. Oliver 
Bogler). Freshly resected tumors were snap frozen in liquid 
nitrogen in the operating room. Samples were sectioned 
in a guillotine in frozen condition, and adjacent pieces 
prepared for Optical Mapping and for re-review by a 
neuropathologist. 

The tumor samples selected for Optical Mapping had to 
meet two criteria. First, they needed to conform to the lp/ 
19q paradigm of treatment sensitivity. LOH status was 
assessed by quantitative PCR of microsatellite markers 
along chromosomes lp and 19q (data not shown). Second, 
they needed to have a high proportion of cancer cells as 
opposed to normal cells. The percentage of tumor cells 
present in each biopsy was estimated by MIB-1 antibody 
staining of an adjacent section (data not shown). The 
MIB-1 antibody recognizes the Ki-67 antigen, which is a 
cell proliferation marker. For the most part, mitotic activ- 
ity is absent in the adult brain, so the measurement of the 
Ki-67 cell proliferation marker can be used to judge tumor 
aggression and composition [144,145]. The two samples 
chosen for this project, HF087 and HF1551, satisfied both 
these criteria. Table 3 provides relevant clinical informa- 
tion for each tumor. 



Extraction of high molecular weight DNA from solid 
tumor biopsies 

The tumor was sectioned into 1-2 mm slices under sterile 
conditions in a cell culture hood. Each slice was treated 
with 0.8% type IV collagenase (Sigma-Aldrich, St. Louis, 
MO) in PBS (Phosphate buffered saline, Life Technologies, 
Carlsbad, CA) for 15 minutes at 37°C. The tumor tissue 
was mechanically disaggregated into a homogeneous sus- 
pension by repeated pipetting. The cells were pelleted by 
centrifugation at 1,000 RPM with a Beckman GS-6R cen- 
trifuge (Beckman Instruments, Fullerton, CA), and then 
resuspended in IX HBS (Hanks Balanced Salts, Life Tech- 
nologies, Carlsbad, CA) in order to lyse red blood cells. 
Cell debris and HBS were removed by centrifugation at 
1,000 RPM. Finally, the pellet was rinsed three times with 
35 mL of PBS, and resuspended in 0.5 mL of PBS. 

A three layer Percoll gradient was employed to enrich 
for cancer cells, and minimize stromal contamination 
[146]. First, a 100% solution was made by using 9 parts 
Percoll (Sigma-Aldrich, St. Louis, MO) and 1 part 10X 
HBS, which was subsequendy diluted with PBS to prepare 
10%, 30%, and 50% solutions. The gradient was prepared by 
layering 2 mL of 50% Percoll, 2 mL of 30% Percoll, and 1 
mL of 10% Percoll in a 15 mL Falcon tube. The single cell 
suspension was then carefully layered on top, and the gradi- 
ent was spun at 1,000 RPM for 10 minutes. Studies have 
shown that cellular debris and non-viable cells are unable 
to penetrate the 30% layer, while lymphocytes pelleted at 
the bottom of the tube. The 30% layer, containing viable 
cells, was carefully removed, rinsed three times with 10 mL 
of PBS and then resuspended in PBS at a final concentra- 
tion of 1X10 cells/mL. Next, this cell suspension was 
mixed 1:1 (v/v) with 1.6% low gelling temperature agarose, 
poured into a mold and cooled to 4°C so that the agarose 
solidified to for gel inserts (each ~100 uL in volume). 

The inserts were treated with 0.5 mg/mL proteinase K 
(Bioline USA, Taunton, MA), 100 mM EDTA pH 8.0 
(Sigma-Aldrich, St. Louis, MO), 0.5% N-lauroylsarcosine 
(Sigma-Aldrich, St. Louis, MO) and incubated at 55°C 
overnight to lyse the tumor cells and degrade cellular pro- 
teins [45,147-150]. Embedding cells in agarose inserts 
eliminates shear induced breakage of genomic DNA mole- 
cules upon lysis [151]. 

Prior to use, the gel inserts were rinsed in TE twice for 1 
hour and then a third time overnight to remove the deter- 
gent and excess EDTA. DNA was electrophoretically 
extracted by applying a cycle of 100 V for 30 seconds 
and -100 V for 6 seconds. 

Generation of single molecule optical maps 

Optical Mapping surfaces were prepared as described 
earlier [152]. Briefly, acid-cleaned glass coverslips (22 x 
22 mm, Fisher's Finest, Fisher Scientific) were treated 
with a mixture of N-trimethoxylsilylpropyl-N,N,N- 
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Table 2 Mutation frequencies of candidate cancer genes 
(COSMIC database) 



Gens symbol 


No. of samples 
with mutations 


No. of unique 
samples 


Mutation 
frequency 


STMN2 


3 


517 


0.580270793 


EIF1 


1 


91 


1.098901099 


LRRN2 


10 


522 


1.915708812 


PAX7 


15 


748 


2.005347594 


PRKG1 


21 


545 


3.853211009 


ARHGAP10 


9 


229 


3.930131004 


PARK2 


10 


204 


4.901960784 


0SBPL3 


11 


206 


5.339805825 


APPL1 


9 


161 


5.590062112 


TCEB3 


/ 


97 


7.216494845 


CCDC91 


8 


97 


8.24742268 


EFHC2 


11 


131 


8.396946565 


TACC2 


30 


312 


9.615384615 


NPAS3 


13 


114 


1 1 .40350877 


CERS3/LASS3 


12 


104 


11.53846154 


SIPA1L3 


12 


102 


1 1 .76470588 


DIAPH2 


19 


143 


13.28671329 


PLEKHM3 


4 


26 


15.38461538 


PHLDB2 


17 


108 


15.74074074 


ALMS1 


28 


165 


1 6.96969697 


ZFYVE26 


35 


187 


18.71657754 


CECR2 


16 


85 


18.82352941 


MYOF 


22 


115 


19.13043478 


LOC339166 


0 


0 





trimethylammonium chloride and vinyltrimethoxysilane 
(Gelest, Morrisville, PA) rendering a positive charge to 
the surface. Genomic DNA, mixed with a sizing stand- 
ard, was elongated via capillary flow in a microfluidic 
device [66], and immobilized by electrostatic interac- 
tions with the positively charged surface, creating arrays 
of stretched, biochemically accessible substrates. The 
surface was then washed with TE (10 mM Tris-HCl, 1 
mM EDTA, pH8.0) twice, equilibrated with digestion 
buffer (NEB buffer 3), then incubated with the restric- 
tion endonuclease Swal (New England Biolabs, Beverly, 
MA), which cleaves the genomic DNA at its cognate 
site. Since the elongated DNA molecule is under slight 
tension, upon cleavage its ends relax, creating a 1-2 mi- 
cron gap, readily detected by microscopy. The resulting 
restriction fragments remain adsorbed to the surface, 
aided by a polyacrylamide overlay, and hence retain their 
order creating, in essence, a barcode from each genomic 
DNA molecule. Restriction fragments were then stained 
with the DNA intercalating dye YOYO-1 (0.2 uM in [3- 



mercaptoethanol/TE, Life Technologies, Carlsbad, CA) 
and imaged by automated fluorescence microscopy. 

The images were collected on an Optical Mapping 
workstation, which consists of Zeiss 135M inverted micro- 
scope (Carl Zeiss, Thornwood, NY), illuminated by 488 
nm argon ion laser (Spectra Physics, Santa Clara, CA) 
equipped with 63X oil immersion objective. Fully auto- 
mated image acquisition software, referred to as Channel 
Collect [66], takes multiple overlapping images to span the 
entire length of each microchannel. The images were ana- 
lyzed by custom machine vision software, called Path- 
finder, which identifies DNA molecules on the surface and 
calculates the size of each restriction fragment based on 
integrated fluorescence intensity measurements relative to 
a sizing standard. Previous studies have shown that inte- 
grated fluorescence intensity scales with fragment mass, 
and is independent of stretch of the DNA molecule. The 
end result of these operations is the high throughput, mas- 
sively parallel generation of single molecule ordered re- 
striction maps, or optical maps, containing information 
about both the size and order of its restriction fragments. 

Pipeline for optical Map assembly and identification of 
structural variants 

The analytical framework for assembly of optical maps is 
analogous to sequence assembly. First, our pipeline auto- 
matically aligned optical maps against a Swal restriction 
map created in silico from the human genome reference se- 
quence (NCBI build 35) via SOMA (Software for Optical 
Map Alignment) using gapped global pair wise alignment 
[56,67]. SOMA uses a scoring function that assigns penal- 
ties for differences in the optical map and the reference 
map, including missing or extra restriction sites, or differ- 
ences in the size of the fragments that could represent in- 
sertions or deletions. The parameters of SOMA were set so 
that we are accurately aligning the molecule to the correct 
location, but loose enough for allow for a small number of 
differences that result from the mutations or polymor- 
phisms present in the genome. The aligned maps were then 
partitioned into smaller bins (1 Mb windows spanning 
across each chromosome, with 500 kb overlap between ad- 
jacent windows) based on their location. The optical maps 
in each bin were assembled into optical consensus maps by 
a map assembler program, using a Bayesian inference algo- 
rithm [153]. Because some structural polymorphisms and 
mutations represent large-scale changes from the reference 
map, an iterative assembly process was used for the analysis 
of human data sets. The consensus map constructed in the 
previous step was used in place of the reference for seven 
more iterations of alignment and assembly, after which it 
was aligned to the reference sequence using SOMA. Using 
this strategy, Rmaps harboring major alterations that 
preclude alignment to the reference were gradually 
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Table 3 Clinical information on the tumors analyzed by Optical Mapping 

Tumor sample Histology Histology code LOH status MIB Index (%) Patient age Patient sex 



HF087 Oligodendroglioma 0 II 

HF1551 Atypical Oligodendroglioma Oil 

O //: Grade II; LOH: loss of heterozygosity. 

incorporated into the consensus map, extending it into re- 
gions that contain more complex rearrangements. 

Lastly, the pipeline automatically performed analysis that 
tabulated structural variants using the final consensus map 
to the reference (derived from NCBI build 35 of the human 
genome [56]) and identified five classes of differences: 
missing cuts, extra cuts, insertions, deletions, and 'other' 
(multiple cut and/or size differences) across each cancer 
genome. Each of these differences, which are largely struc- 
tural variants, has to satisfy certain statistical and empirical 
criteria. These parameters have been detailed in Teague 
et al. [56]. The only difference being the indel calling 
threshold, which was increased to a 13% change relative 
to the reference, with a 4.5 kb minimum. 

Additionally, each structural variant was manually cu- 
rated to ensure that the most conservative decision has 
been made at every locus. The genomic locations of the 
variants were converted to NCBI build 37 co-ordinates 
using the Batch Coordinate Conversion (liftover) tool 
from the University of California Santa Cruz Genome 
Browser [154] (http://genome.ucsc.edu). 

Optical map coverage analysis 

Variations in depth of coverage of optical maps aligned by 
SOMA across the genome can be used to detect copy 
number alterations. Intuitively, if a region of the tumor 
sample has increased (or decreased) copy number relative 
to the 'normal' reference genome, more (or less) maps will 
originate from it on an average. This is formalized as de- 
scribed. Pair-wise alignments of optical maps to an in silico 
reference were summarized by a single number (midpoint) 
representing location. These locations were modeled as re- 
alizations of a non-homogeneous Poisson process. The 
non-homogeneity arises from the fact that the likelihood of 
a map aligning to a genomic region depends on the density 
of restriction sites, and was accounted for using alignment 
data from a normal genome, which are used to define ran- 
dom intervals with counts that follow a negative binomial 
distribution. These counts were then modeled by a Hidden 
Markov Model, incorporating spatial dependence in the 
data and allowing more natural estimation of certain pa- 
rameters [68,69]. 

Affymetrix genome wide human SNP array 6.0 

DNA was prepared for hybridization using the Blood 
and Cell Culture Kit (Qiagen, Valencia, CA), starting 
from frozen cells (HF087), or tumor tissue (HF1551), 
disaggregated into single cells as described previously. 



LOH 5-7% 65 Female 

LOH 4-13% 30 Female 



The HF087 cells were derived from the same slice used 
for Optical Mapping. However, since the same was not 
available for HF1551, a slice adjacent to the one used for 
mapping was used. 

The DNA was digested with Nspl and Styl restriction 
enzymes and ligated to adaptors that recognize the 4 bp 
overhangs. A generic primer that anneals to the adaptor 
sequence was then used to amplify adaptor-ligated DNA 
fragments, under PCR conditions optimized to preferen- 
tially amplify fragments in the 200 to 1,100 bp size range. 
The amplified DNA was then fragmented, labelled, and 
hybridized to a Genome-Wide Human SNP 6.0 Array (ex- 
periments were performed by the DNA Facility at the 
Carver College of Medicine, University of Iowa). Data ana- 
lysis was performed using Genotyping Console 2.0 
(Affymetrix, Santa Clara, CA). CNVs were called using ei- 
ther the Affymetrix algorithm (with default parameters) 
or five different algorithms (GLAD, Circular Binary Seg- 
mentation, Fused Lasso, Gaussian Model with Adaptive 
Penalty, Forward-Backward Fragment Annealing Segmen- 
tation) from CGHweb (http://compbio.med.harvard.edu/ 
CGHweb/) [155]. Only CNV calls made by two or more 
algorithms were considered for comparison. 

Parameters for comparing oligodendroglioma structural 
variants 

To other optical mapping datasets 

Only variants of the same type were compared to each 
other, e.g.: MCs from HF087 were compared to MCs from 
lymphoblast cell line GM15510. Intersection 'windows' were 
set based on the type of OSA (100 bp for MCs, 4200 bp for 
ECs and 0 bp for INS, DEL and OTHER) and are reflective 
of the error processes inherent to each type of event. 

To published SNPs and structural variants 

Published SNPs were compared against Optical Mapping 
cut differences using 100 bp or 3000 bp windows for 
MCs and ECs, respectively. 

Structural variants from the latest (November 2010) 
release of the Database of Genomic Variants [156] were 
divided into two categories on the basis of their size. 
Events smaller than 3 kb were compared to ECs and 
MCs, since ~l/3 rd of indels that are below the lower 
limit of detection for Optical Mapping manifest them- 
selves as cut differences [56]. Events larger than 3 kb 
were compared to INS, DEL and OTHER variants using 
a 0 bp intersection window. 
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Additional file 3: Detailed description of intersections between 
oligodendroglioma structural variants and genes, SNPs, variants 
from the Database of Genomic Variants and other normal human 
optical maps. This spreadsheet provides a detailed breakdown of the 
overlap between Each row in the spreadsheet shows an optical map 
difference (column A), it's location (columns B-D), and genes (column E), 
variants from the Database of Genomic Variants (column F, G), snip-SNPs 
(column H), and structural variants from other Optical Mapping datasets 
(columns l-N) that overlap with it. The number is parenthesis after each 
column header indicates the intersection window. 

Additional file 4: Experimental validation of oligodendroglioma ECs 

and MCs by SNP array. This table lists cut differences found in HF087 
and HF1551 (column A), their location (columns B-D), type (column E), 
the SNP genotype corresponding to it (column F) and whether it agrees 
with the optical map (column G). 

Additional file 5: Experimental validation of oligodendroglioma 
indels. This table lists indels found in HF087 and HF1551 (column A), 
their location (columns B-D) and whether it is validated by a given copy 
number algorithm (columns E-J). 

Additional file 6: Oligodendroglioma structural variants and their 
intersection (counts) with variants detected in six other normal 
human optical maps. This table lists counts of structural variants from 
oligodendroglioma that intersect with variants found in other normal human 
genomes that have been analyzed by Optical Mapping. Column A specifies 
the genomic element; columns B and H indicate total counts for HF087 and 
HF1551 respectively; Columns C-G and l-M shows overlap counts by variant 
class. Only variants of the same category are included in the comparison. 

Additional file 7: Non-genic candidates found in 
oligodendroglioma and functional elements from ENCODE that 
intersect them. Column A lists identifiers for candidate loci that do not 
occur within a gene, their locations (columns B-D), overlapping 
transcripts found by GENCODE (column E), and predicted genomic state 
in different cell types (columns F-N). The cell types in red font are cancer 
cell lines. Different genomic states are color-coded as per ENCODE 
website, and is detailed in the key. 

Additional file 8: Differentially expressed genes in GEO dataset 
GSE4290, analyzed by Ebarrays, p = 1 E-03. Entrez gene identifer 
(column A), gene symbol (column B), gene name (column C), cellular 
location (column D) and molecular function (column E) of all differentially 
expressed genes in GEO dataset GSE4290. 

Additional file 9: Number of oligodendrogioma samples in 
REMBRANDT database where a given candidate gene is up or down 
regulated by at least 2 fold. This table lists candidate genes identified 
through Optical Mapping (column A), and the number of 
oligodendroglioma samples in the REMBRANDT database where that 
gene is up or down regulated by at least two fold (column B). 



PCR validation 

Template for PCR was prepared by whole genome amplifi- 
cation of tumor DNA using the REPLI-g Mini kit (Qiagen 
Inc., Valencia, CA) as per the protocol provided by the 
manufacturer. Pooled normal DNA from 6 individuals 
(Promega Corporation, Madison, WI) was used for control 
reactions. Primers were designed using freely available soft- 
ware Primer 3 Plus [157]. PCR reactions were performed 
using reagents from the Expand Long Template PCR Sys- 
tem (Roche Applied Science, Indianapolis, IN) following 
the protocol supplied by the manufacturer. PCR reactions 
were digested with appropriate restriction enzymes to es- 
tablish that the correct region had been amplified. The 
amplicon was then cloned in E. coli using the TOPO TA 
Cloning Kit (Invitrogen, Carlsbad, CA), plasmid DNA was 
purified using the Qiagen Plasmid Mini Kit (Qiagen Inc., 
Valencia, CA), and sequenced using Sanger biochemistry. 

Targeted assemblies on Williams-Beuren 
chromosomal region 

The Swal in-silico restriction map from the Williams-Beuren 
region on chromosome 14 was modified to reflect one of 
eight alterations: 4 possible inversions, each with unique 
start/end locations and spans (including the 'canonical' inver- 
sion), and 4 possible deletions, each with unique start/end 
locations and sizes (including the canonical' deletion). These 
modified in-silico maps were subjected to 8 rounds of itera- 
tive assembly, using the collection of HF1551 Rmaps, with 
the same parameters as the genome-wide assembly. The 
results were manually curated to rule out assembly errors. 
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Additional file 1 : Basic attributes of oligodendroglioma datasets. This 
spreadsheet describes the two optical maps in detail. Column A lists specific 
characteristics associated with each step of constructing the map. Columns 
B and C lists its values for tumor HF087 (column B) and HF1551 (column C). 

Additional file 2: Overlap (counts) between oligodendroglioma 
structural variants and genes, segmental duplications, SNPs, and 
variants reported by other investigators. This table lists counts of 
structural variants from oligodendroglioma that intersect with genes, 
segmental duplications, SNPs, and events from the Database of Genomic 
Variants (DGV). Column A specifies the genomic element; columns B and H 
indicate total counts for HF087 and HF1551 respectively; Columns C-G and 
l-M shows overlap counts by variant class. Variants from the DGV are divided 
based on size into those over 3 kb and those under then 3 kb, then further 
by study. Variants less than 3 kb are compared to optical map EC and MCs, 
while those over 3 kb are compared to INS, DEL and OTHER. 



Ray ef al. BMC Genomics 2013, 14:505 
http://www.biomedcentral.com/1471 -21 64/1 4/505 



Page 18 of 21 



Acknowledgements 

We thank Nick Shera for his early efforts during the data acquisition phase of 
this project and Ezra Lyon for his bioinformatic support. We also thank NHGRI 
and NCI for support (D.C.S.; R01- HG000225; R33CA1 1 1933); M.R would also like 
to acknowledge the Morgridge Biotechnology Fellowship for funding. 

Author details 

'Laboratory for Molecular and Computational Genomics, Department of 
Chemistry, Laboratory of Genetics, UW Biotechnology Center, University of 
Wisconsin-Madison, Madison, Wl 53706, USA. theoretical Statistics and 
Mathematics Unit, Indian Statistical Institute, Delhi, India, department of 
Biostatistics and Medical Informatics, University of Wisconsin-Madison, 
Madison, Wl 53706, USA. department of Statistics, University of Wisconsin- 
Madison, Madison, Wl 53706, USA. 5 M.D. Anderson Cancer Center, University 
of Texas, Houston, TX 77030, USA. 

Received: 7 March 2013 Accepted: 23 July 2013 
Published: 26 July 2013 

References 

1 . Slamon DJ, Clark GM, Wong SG, Levin WJ, Ullrich A, McGuire WL: Human 
breast cancer: correlation of relapse and survival with amplification of 
the HER-2/neu oncogene. Science 1987, 235(4785):1 77-1 82. 

2. Bos JL: Ras oncogenes in human cancer: a review. Cancer Res 1989, 
49(1 7):4682-4689. 

3. Murphree AL, Benedict WF: Retinoblastoma: clues to human oncogenesis. 
Science 1984, 223(4640):! 028-1 033. 

4. Orlow I, Lacombe L, Hannon GJ, Serrano M, Pellicer I, Dalbagni G, Reuter VE, 
Zhang ZF, Beach D, Cordon-Cardo C: Deletion of the pi 6 and pi 5 genes 
in human bladder tumors. J Natl Cancer Inst 1995, 87(20):1 524-1 529. 

5. ar-Rushdi A, Nishikura K, Erikson J, Watt R, Rovera G, Croce CM: Differential 
expression of the translocated and the untranslocated c-myc oncogene 
in Burkitt lymphoma. Science 1983, 222(4622)390-393. 

6. Shtivelman E, Lifshitz B, Gale RP, Canaani E: Fused transcript of abl and bcr 
genes in chronic myelogenous leukaemia. Nature 1985, 315(6020)550-554 

7. Albertson DG, Collins C, McCormick F, Gray JW: Chromosome aberrations 
in solid tumors. Nat Genet 2003, 34(4)369-376. 

8. Farabegoli F, Santini D, Ceccarelli C, Taffurelli M, Marrano D, Baldini N: Clone 
heterogeneity in diploid and aneuploid breast carcinomas as detected 
by FISH. Cytometry 2001, 46(1)50-56. 

9. Marusyk A, Polyak K: Tumor heterogeneity: causes and consequences. 
Blochim BiophysActa 1805, 1:105-117. 

10. Navin N, Krasnitz A, Rodgers L, Cook K, Meth J, Kendall J, Riggs M, Eberling 
Y, Troge J, Grubor V, ef al: Inferring tumor progression from genomic 
heterogeneity. Genome Res 2010, 20(1):68-80. 

1 1. Zhang FF, Murata-Collins JL, Gaytan P, Forman SJ, Kopecky KJ, Willman CL, 
Appelbaum FR, Slovak ML: Twenty-four-color spectral karyotyping reveals 
chromosome aberrations in cytogenetically normal acute myeloid 
leukemia. Genes Chromosomes Cancer 2000, 28(3)318-328. 

1 2. Hicks DG, Tubbs RR: Assessment of the HER2 status in breast cancer by 
fluorescence in situ hybridization: a technical review with interpretive 
guidelines. Hum Pathol 2005, 36(3)250-261 . 

1 3. Pinkel D, Segraves R, Sudar D, Clark S, Poole I, Kowbel D, Collins C, Kuo WL, 
Chen C, Zhai Y, ef al: High resolution analysis of DNA copy number 
variation using comparative genomic hybridization to microarrays. 

Nat Genet 1 998, 20(2)207-21 1 . 

1 4. Ishkanian AS, Malloff CA, Watson SK, DeLeeuw RJ, Chi B, Coe BP, Snijders A, 
Albertson DG, Pinkel D, Marra MA, ef al: A tiling resolution DNA microarray with 
complete coverage of the human genome. Nat Genet 2004, 36(3)299-303. 

15. Kallioniemi A, Kallioniemi OP, Sudar D, Rutovitz D, Gray JW, Waldman F, 
Pinkel D: Comparative genomic hybridization for molecular cytogenetic 
analysis of solid tumors. Science 1992, 258(5083):81 8-821. 

16. Pierotti MA, Santoro M, Jenkins RB, Sozzi G, Bongarzone I, Grieco M, Monzini 
N, Miozzo M, Herrmann MA, Fusco A, ef al: Characterization of an 
inversion on the long arm of chromosome 10 juxtaposing D10S170 and 
RET and creating the oncogenic sequence RET/PTC. Proc Natl Acad Sci 
USA 1992, 89(5):1616-1620. 

1 7. Soda M, Choi YL, Enomoto M, Takada S, Yamashita Y, Ishikawa S, Fujiwara S, 
Watanabe H, Kurashina K, Hatanaka H, ef al: Identification of the 
transforming EML4-ALK fusion gene in non-small-cell lung cancer. 
Nature 2007, 448(7153)561-566. 



18. Tomlins SA, Rhodes DR, Perner S, Dhanasekaran SM, Mehra R, Sun XW, 
Varambally S, Cao X, Tchinda J, Kuefer R, ef al: Recurrent fusion of 
TMPRSS2 and ETS transcription factor genes in prostate cancer. 
Science 2005, 310(5748):644-648. 

1 9. Hajirasouliha I, Hormozdiari F, Alkan C, Kidd JM, Birol I, Eichler EE, Sahinalp SC: 
Detection and characterization of novel sequence insertions using 
paired-end next-generation sequencing. Bioinformatics 2010, 26(1 0):1 277—1283. 

20. Liu G, Gramling S, Munoz D, Cheng D, Azad AK, Mirshams M, Chen Z, Xu W, 
Roberts H, Shepherd FA, ef al: Two novel BRM insertion promoter 
sequence variants are associated with loss of BRM expression and lung 
cancer risk. Oncogene 201 1, 30(29)3295-3304. 

21. Bailey JA, Eichler EE: Primate segmental duplications: crucibles of 
evolution, diversity and disease. Nat Rev Genet 2006, 7(7):552-564. 

22. Mefford HC, Eichler EE: Duplication hotspots, rare genomic disorders, and 
common disease. Curr Opin Genet Dev 2009, 19(3): 196-204. 

23. Alkan C, Coe BP, Eichler EE: Genome structural variation discovery and 
genotyping. Nat Rev Genet 201 1, 12(5)363-376. 

24. Wood LD, Parsons DW, Jones S, Lin J, Sjoblom T, Leary RJ, Shen D, Boca SM, 
Barber T, Ptak J, ef al: The genomic landscapes of human breast and 
colorectal cancers. Science 2007, 31 8(5853):! 1 08-1 113. 

25. Stephens P, Edkins S, Davies H, Greenman C, Cox C, Hunter C, Bignell G, 
Teague J, Smith R, Stevens C, ef al: A screen of the complete protein 
kinase gene family identifies diverse patterns of somatic mutations in 
human breast cancer. Nat Genet 2005, 37(6)590-592. 

26. Bardelli A, Parsons DW, Silliman N, Ptak J, Szabo S, Saha S, Markowitz S, 
Willson JK, Parmigiani G, Kinzler KW, ef al: Mutational analysis of the 
tyrosine kinome in colorectal cancers. Science 2003, 300(5621 ):949. 

27. Paez JG, Janne PA, Lee JC, Tracy S, Greulich H, Gabriel S, Herman P, Kaye FJ, 
Lindeman N, Boggon TJ, ef al: EGFR mutations in lung cancer: correlation with 
clinical response to gefitinib therapy. Science 2004, 304(5676):1497-1500. 

28. Shah SP, Roth A, Goya R, Oloumi A, Ha G, Zhao Y, Turashvili G, Ding J, Tse K, 
Haffari G, ef al: The clonal and mutational evolution spectrum of primary 
triple-negative breast cancers. Nature 2012, 486(7403)395-399. 

29. Sjoblom T, Jones S, Wood LD, Parsons DW, Lin J, Barber TD, Mandelker D, 
Leary RJ, Ptak J, Silliman N, ef al: The consensus coding sequences of 
human breast and colorectal cancers. Science 2006, 314(5797)268-274 

30. Bettegowda C, Agrawal N, Jiao Y, Sausen M, Wood LD, Hruban RH, 
Rodriguez FJ, Cahill DP, McLendon R, Riggins G, ef al: Mutations in CIC and 
FUBP1 contribute to human oligodendroglioma. Science 201 1, 
333(6048):1453-1455. 

31. Agrawal N, Frederick MJ, Pickering CR, Bettegowda C, Chang K, Li RJ, Fakhry 
C, Xie TX, Zhang J, Wang J, ef al: Exome sequencing of head and neck 
squamous cell carcinoma reveals inactivating mutations in NOTCH1. 
Science 201 1, 333(6046)4 154-1 157. 

32. Network CGAR: Integrated genomic analyses of ovarian carcinoma. 
Nature 201 1 , 474(7353):609-61 5. 

33. Chapman MA, Lawrence MS, Keats JJ, Cibulskis K, Sougnez C, Schinzel AC, 
Harview CL, Brunet JP, Ahmann GJ, Adli M, ef al: Initial genome sequencing 
and analysis of multiple myeloma. Nature 201 1, 471 (7339):467-472. 

34. Ding L, Ley TJ, Larson DE, Miller CA, Koboldt DC, Welch JS, Ritchey JK, 
Young MA, Lamprecht T, McLellan MD, ef al: Clonal evolution in relapsed 
acute myeloid leukaemia revealed by whole-genome sequencing. 
Nature 201 2, 481 (7382)506-5 1 0. 

35. Ding L, Ellis MJ, Li S, Larson DE, Chen K, Wallis JW, Harris CC, McLellan MD, 
Fulton RS, Fulton LL, ef al: Genome remodelling in a basal-like breast 
cancer metastasis and xenograft. Nature 2010, 464(7291)399-1005. 

36. Lee W, Jiang Z, Liu J, Haverty PM, Guan Y, Stinson i, Yue P, Zhang Y, Pant 
KP, Bhatt D, ef al: The mutation spectrum revealed by paired genome 
sequences from a lung cancer patient. Nature 2010, 465(7297)473-477. 

37. Ley TJ, Mardis ER, Ding L, Fulton B, McLellan MD, Chen K, Dooling D, 
Dunford-Shore BH, McGrath S, Hickenbotham M, ef al: DNA sequencing of 
a cytogenetically normal acute myeloid leukaemia genome. Nofure 2008, 
456(721 8):66-72. 

38. Pleasance ED, Cheetham RK, Stephens PJ, McBride DJ, Humphray SJ, 
Greenman CD, Varela I, Lin ML, Ordonez GR, Bignell GR, ef al: A 
comprehensive catalogue of somatic mutations from a human cancer 
genome. Nature 2010, 463(7278)4 91. 

39. Pleasance ED, Stephens PJ, O'Meara S, McBride DJ, Meynert A, Jones D, Lin 
ML, Beare D, Lau KW, Greenman C, ef al: A small-cell lung cancer genome 
with complex signatures of tobacco exposure. Nature 2010, 
463(7278):! 84-1 90. 



Ray ef al. BMC Genomics 2013, 14:505 
http://www.biomedcentral.com/1471 -21 64/1 4/505 



Page 19 of 21 



40. Shah SP, Morin RD, Khattra J, Prentice L, Pugh T, Burleigh A, Delaney A, Gelmon 
K, Guliany R, Senz J, ef al: Mutational evolution in a lobular breast tumour 
profiled at single nucleotide resolution. Nature 2009, 461 (726S):809-81 3. 

41. Mills RE, Walter K, Stewart C, Handsaker RE, Chen K, Alkan C, Abyzov A, 
Yoon SC, Ye K, Cheetham RK, ef al: Mapping copy number variation by 
population-scale genome sequencing. Nofure 201 1, 470(7332):59-65. 

42. Alkan C, Sajjadian S, Eichler EE: Limitations of next-generation genome 
sequence assembly. Nat Methods 201 1, 8(l):61-65. 

43. Consortium GP: A map of human genome variation from 
population-scale sequencing. Nature 2010, 467(731 9):1 061 -1 073. 

44. Medvedev P, Stanciu M, Brudno M: Computational methods for 
discovering structural variation with next-generation sequencing. 
Nat Methods 2009, 6(1 1 Suppl):S13-S20. 

45. Reslewic S, Zhou S, Place M, Zhang Y, Briska A, Goldstein S, Churas C, 
Runnheim R, Forrest D, Lim A, ef al: Whole-genome shotgun Optical 
Mapping of Rhodospirillum rubrum. Appl Environ Microbiol 2005, 
71(9)5511-5522. 

46. Lai Z, Jing J, Aston C, Clarke V, Apodaca J, Dimalanta ET, Carucci DJ, Gardner 
MJ, Mishra B, Anantharaman TS, ef al: A shotgun optical map of the entire 
Plasmodium falciparum genome. Nat Genet 1999, 23(3)309-313. 

47. Lin J, Qi R, Aston C, Jing J, Anantharaman TS, Mishra B, White O, Daly MJ, 
Minton KW, Venter JC, ef al: Whole-genome shotgun Optical Mapping of 
Deinococcus radiodurans. Science 1999, 285(5433):! 558-1 562. 

48. Schwartz DC, Li X, Hernandez LI, Ramnarain SP, Huff EJ, Wang YK: Ordered 
restriction maps of Saccharomyces cerevisiae chromosomes constructed 
by Optical Mapping. Science 1993, 262(5130):! 10-1 14. 

49. Young ND, Debelle F, Oldroyd GE, Geurts R, Cannon SB, Udvardi MK, 
Benedito VA, Mayer KF, Gouzy J, Schoof H, ef al: The Medicago genome 
provides insight into the evolution of rhizobial symbioses. Nature 201 1, 
480(7378):520-524. 

50. Zhou S, Wei F, Nguyen J, Bechner M, Potamousis K, Goldstein S, Pape L, 
Mehan MR, Churas C, Pasternak S, ef al: A single molecule scaffold for the 
maize genome. PLoS Genet 2009, 5(1 1 ):el 00071 1. 

51. Schnable PS, Ware D, Fulton RS, Stein JC, Wei F, Pasternak S, Liang C, Zhang 
J, Fulton L, Graves TA, ef al: The B73 maize genome: complexity, diversity, 
and dynamics. Science 2009, 326(5956):! 112-1115. 

52. Zhou S, Bechner MC, Place M, Churas CP, Pape L, Leong SA, Runnheim R, 
Forrest DK, Goldstein S, Livny M, ef al: Validation of rice genome sequence 
by Optical Mapping. BMC Genomics 2007, 8:278. 

53. Kidd JM, Cooper GM, Donahue WF, Hayden HS, Sampas N, Graves T, Hansen N, 
Teague B, Alkan C, Antonacci F, ef al: Mapping and sequencing of structural 
variation from eight human genomes. Nature 2008, 453(7191)56-64. 

54. Zody MC, Garber M, Adams DJ, Sharpe T, Harrow J, Lupski JR, Nicholson C, 
Searle SM, Wilming L, Young SK, ef al: DNA sequence of human 
chromosome 17 and analysis of rearrangement in the human lineage. 
Nature 2006, 440(7087):! 045-1 049. 

55. Ananiev GE, Goldstein S, Runnheim R, Forrest DK, Zhou S, Potamousis K, 
Churas CP, Bergendahl V, Thomson JA, Schwartz DC: Optical Mapping 
discerns genome wide DNA methylation profiles. BMC Mol Biol 2008, 9:68. 

56. Teague B, Waterman MS, Goldstein S, Potamousis K, Zhou S, Reslewic S, 
Sarkar D, Valouev A, Churas C, Kidd JM, ef al: High-resolution human 
genome structure by single-molecule analysis. Proc Natl Acad Sci USA 
2010, 107(24): 1 0848-1 0853. 

57. Church DM, Goodstadt L, Hillier LW, Zody MC, Goldstein S, She X, Bult CJ, 
Agarwala R, Cherry JL, DiCuccio M, ef al: Lineage-specific biology revealed by a 
finished genome assembly of the mouse. PLoSBiol 2009, 7(5):e10001 12 

58. DeAngelis LM: Brain tumors. N Engl J Med 2001, 344(2): 1 14-1 23. 

59. Reifenberger G, Louis DN: Oligodendroglioma: toward molecular 
definitions in diagnostic neuro-oncology. J Neuropathol Exp Neurol 2003, 
62(2):1 11-126. 

60. Cairncross G, Jenkins R: Gliomas with 1p/19q codeletion: a.k.a. 
oligodendroglioma. Cancer J 2008, 14(6)352-357. 

61. Cairncross JG, Ueki K, Zlatescu MC, Lisle DK, Finkelstein DM, Hammond RR, 
Silver JS, Stark PC, Macdonald DR, Ino Y, ef al: Specific genetic predictors 
of chemotherapeutic response and survival in patients with anaplastic 
oligodendrogliomas. J Natl Cancer Inst 1998, 90(19):1473-1479. 

62. Walker C, du Plessis DG, Joyce KA, Fildes D, Gee A, Haylock B, Husband D, Smith 
T, Broome J, Warnke PC: Molecular pathology and clinical characteristics of 
oligodendroglial neoplasms. Ann Neurol 2005, 57(6):855-865. 

63. Bromberg JE, van den Bent MJ: Oligodendrogliomas: molecular biology 
and treatment. Oncologist 2009, 1 4(2):1 55- 1 63. 



64. Wharton SB, Maltby E, Jellinek DA, Levy D, Atkey N, Hibberd S, Crimmins D, 
Stoeber K Williams GH: Subtypes of oligodendroglioma defined by 1 p,19q 
deletions, differ in the proportion of apoptotic cells but not in replication- 
licensed non-proliferating cells. Acta Neuropathol 2007, 1 1 3(2):1 1 9-1 27. 

65. van den Bent MJ, Looijenga LH, Langenberg K Dinjens W, Graveland W, 
Uytdewilligen L, Sillevis Smitt PA, Jenkins RB, Kros JM: Chromosomal anomalies 
in oligodendroglial tumors are correlated with clinical features. Cancer 2003, 
97(5)1276-1284. 

66. Dimalanta ET, Lim A, Runnheim R, Lamers C, Churas C, Forrest DK, de Pablo 
JJ, Graham MD, Coppersmith SN, Goldstein S, ef al: A microfluidic system 
for large DNA molecule arrays. Anal Chem 2004, 76(18)5293-5301 . 

67. Valouev A, Li L, Liu YC, Schwartz DC, Yang Y, Zhang Y, Waterman MS: 
Alignment of optical maps. J Comput Biol 2006, 13(2)442-462. 

68. Sarkar D: On the analysis of Optical Mapping data. Madison, Wl: University of 
Wisconsin: 2006. 

69. Sarkar D, Goldstein S, Schwartz DC, Newton MA: Statistical significance of 
optical map alignments. J Comput Biol 2012, 19(5):478-492. 

70. Griffin CA, Burger P, Morsberger L, Yonescu R, Swierczynski S, Weingart JD, 
Murphy KM: Identification of der(1;19)(q10;p10) in five 
oligodendrogliomas suggests mechanism of concurrent 1p and 19q loss. 
J Neuropathol Exp Neurol 2006, 65(10):988-994. 

71. Jenkins RB, Blair H, Ballman KV, Giannini C, Arusell RM, Law M, Flynn H, 
Passe S, Felten S, Brown PD, ef al: A t(1;19)(q10;pl0) mediates the 
combined deletions of 1p and 19q and predicts a better prognosis of 
patients with oligodendroglioma. Cancer Res 2006, 66(20)3852-9861. 

72. Anderson K Lutz C, van Delft FW, Bateman CM, Guo Y, Colman SM, Kempski H, 
Moorman AV, Ttley I, Swansbury J, ef al: Genetic variegation of clonal architecture 
and propagating cells in leukaemia. Nature 201 1, 469(7330)356-361. 

73. Gerlinger M, Rowan AJ, Horswell S, Larkin J, Endesfelder D, Gronroos E, 
Martinez P, Matthews N, Stewart A, Tarpey P, ef al: Intratumor 
heterogeneity and branched evolution revealed by multiregion 
sequencing. N Engl J Med 201 2, 366(1 0):883-892. 

74. Yap TA, Gerlinger M, Futreal PA, Pusztai L, Swanton C: Intratumor heterogeneity: 
seeing the wood for the trees. Sci Transl Med 201 2, 4(1 27):1 27-1 1 0. 

75. Wicks SR, Yeh RT, Gish WR, Waterston RH, Plasterk RH: Rapid gene mapping 
in Caenorhabditis elegans using a high density polymorphism map. 

Nat Genet 2001, 28(2):1 60-164. 

76. Macintyre G, Alford T Xiong L, Rouleau GA, Tibbo PG, Cox DW: Association of 
NPAS3 exonic variation with schizophrenia. SchizophrRes 2010, 120(1 -3): 143- 149. 

77. Pickard BS, Christoforou A, Thomson PA, Fawkes A, Evans KL, Morris SW, Porteous 
DJ, Blackwood DH, Muir WJ: Interacting haplotypes at the NPAS3 locus alter 
risk of schizophrenia and bipolar disorder. Mol Psychiatry 2009, 14(9):874-884. 

78. Moreira F, Kiehl TR, So K, Ajeawung NF, Honculada C, Gould P, Pieper RO, 
Kamnasaran D: NPAS3 demonstrates features of a tumor suppressive role in 
driving the progression of Astrocytomas. Am J Pathol 2011, 179(1)462-476. 

79. Lehto M, Olkkonen VM: The OSBP-related proteins: a novel protein family 
involved in vesicle transport, cellular lipid metabolism, and cell 
signalling. Biochim Biophys Acta 2003, 1631(1):1-11. 

80. Lehto M, Tienari J, Lehtonen S, Lehtonen E, Olkkonen VM: Subfamily III of 
mammalian oxysterol-binding protein (OSBP) homologues: the expression and 
intracellular localization of ORP3, ORP6, and ORP7. Cell Tissue Res 20O4, 315(1)39-57. 

81. Lehto M, Mayranpaa Ml, Pellinen T, lhalmo P, Lehtonen S, Kovanen PT, 
Groop PH, Ivaska J, Olkkonen VM: The R-Ras interaction partner ORP3 
regulates cell adhesion. J Cell Sci 2008, 121(Pt 5):695-705. 

82. Sander B, Flygare J, Porwit-Macdonald A, Smith CI, Emanuelsson E, Kimby E, 
Liden J, Christensson B: Mantle cell lymphomas with low levels of cyclin D1 
long mRNA transcripts are highly proliferative and can be discriminated by 
elevated cyclin A2 and cyclin B1. Int J Cancer 2005, 1 1 7(3)41 8-430. 

83. Ek S, Andreasson U, Hober S, Kampf C, Ponten F, Uhlen M, Merz H, 
Borrebaeck CA: From gene expression analysis to tissue microarrays: 
a rational approach to identify therapeutic and diagnostic targets in 
lymphoid malignancies. Mol Cell Proteomics 2006, 5(6):1 072-1 081. 

84. Bijl J, Sauvageau M, Thompson A, Sauvageau G: High incidence of proviral 
integrations in the Hoxa locus in a new model of E2a-PBX1 -induced 
B-cell leukemia. Genes Dev 2005, 19(2)224-233. 

85. Abbas N, Lucking CB, Ricard S, Durr A, Bonifati V, De Michele G, Bouley S, 
Vaughan JR, Gasser T, Marconi R, ef al: A wide variety of mutations in the 
parkin gene are responsible for autosomal recessive parkinsonism in 
Europe. French Parkinson's Disease Genetics Study Group and the 
European Consortium on Genetic Susceptibility in Parkinson's Disease. 
Hum Mol Genet 1999, 8(4)567-574. 



Ray ef al. BMC Genomics 2013, 14:505 
http://www.biomedcentral.com/1471 -21 64/1 4/505 



86. Kitada T, Asakawa S, Hattori N, Matsumine H, Yamamura Y, Minoshima 5, 
Yokochi M, Mizuno Y, Shimizu N: Mutations in the parkin gene cause 
autosomal recessive juvenile parkinsonism. Nature 1998, 392(6676):605-608. 

87. Lucking CB, Durr A, Bonifati V, Vaughan J, De Michele G, Gasser T, Harhangi 
B5, Meco G, Denefle P, Wood NW, ef al: Association between early-onset 
Parkinson's disease and mutations in the parkin gene. N Engl J Med 2000, 
342(21):1560-1567. 

88. Veeriah S, Taylor BS, Meng S, Fang F, Yilmaz E, Vivanco I, Janakiraman M, 
Schultz N, Hanrahan AJ, Pao W, ef al: Somatic mutations of the Parkinson's 
disease-associated gene PARK2 in glioblastoma and other human 
malignancies. Nat Genet 201 0, 42(1 ):77-82. 

89. Cesari R, Martin ES, Calin GA, Pentimalli F, Bichi R, McAdams H, Trapasso F, 
Drusco A, Shimizu M, Masciullo V, ef al: Parkin, a gene implicated in 
autosomal recessive juvenile parkinsonism, is a candidate tumor 
suppressor gene on chromosome 6q25-q27. Proc Natl Acad Sci USA 2003, 
100(10)5956-5961. 

90. Denison SR, Wang F, Becker NA, Schule B, Kock N, Phillips LA, Klein C, Smith 
Dl: Alterations in the common fragile site gene Parkin in ovarian and 
other cancers. Oncogene 2003, 22(51 ):8370-8378. 

91. Fujiwara M, Marusawa H, Wang HQ, Iwai A, Ikeuchi K, Imai Y, Kataoka A, 
Nukina N, Takahashi R, Chiba T: Parkin as a tumor suppressor gene for 
hepatocellular carcinoma. Oncogene 2008, 27(46):6002-601 1 . 

92. Poulogiannis G, Mclntyre RE, Dimitriadi M, Apps JR, Wilson CH, Ichimura K, 
Luo F, Cantley LC, Wyllie AH, Adams DJ, ef al: PARK2 deletions occur 
frequently in sporadic colorectal cancer and accelerate adenoma 
development in Ape mutant mice. Proc Natl Acad Scl USA 2010, 

1 07(34):1 51 45-1 51 50. 

93. Wang F, Denison S, Lai JP, Philips LA, Montoya D, Kock N, Schule B, Klein C, 
Shridhar V, Roberts LR, ef al: Parkin gene alterations in hepatocellular 
carcinoma. Genes Chromosomes Cancer 2004, 40(2):85-96. 

94. Picchio MC, Martin ES, Cesari R, Calin GA, Yendamuri S, Kuroki T, Pentimalli 
F, Sarti M, Yoder K, Kaiser LR, ef al: Alterations of the tumor suppressor 
gene Parkin in non-small cell lung cancer. Clin Cancer Res 2004, 
10(8)2720-2724. 

95. Kahkonen M: Population cytogenetics of folate-sensitive fragile sites. I. 
Common fragile sites. Hum Genet 1988, 80(4)344-348. 

96. Ren Y, Jiang H, Yang F, Nakaso K, Feng J: Parkin protects dopaminergic 
neurons against microtubule-depolymerizing toxins by attenuating 
microtubule-associated protein kinase activation. J Biol Chem 2009, 
284(6):4009-401 7. 

97. Wang H, Liu B, Zhang C, Peng G, Liu M, Li D, Gu F, Chen Q, Dong JT, Fu L, 
ef al: Parkin regulates paclitaxel sensitivity in breast cancer via a 
microtubule-dependent mechanism. J Pathol 2009, 218(1)76-85. 

98. Yang F, Jiang Q, Zhao J, Ren Y, Sutton MD, Feng J: Parkin stabilizes 
microtubules through strong binding mediated by three independent 
domains. J Biol Chem 2005, 280(1 7): 1 71 54-1 7 1 62. 

99. Manna T, Grenningloh G, Miller HP, Wilson L: Stathmin family protein 
SCG10 differentially regulates the plus and minus end dynamics of 
microtubules at steady state in vitro: implications for its role in neurite 
outgrowth. Biochemistry 2007, 46(1 1)3543-3552. 

100. Lee HS, Lee DC, Park MH, Yang SJ, Lee JJ, Kim DM, Jang Y, Lee JH, Choi JY, 
Kang YK, ef al: STMN2 is a novel target of beta-catenin/TCF-mediated 
transcription in human hepatoma cells. Biochem Biophys Res Commun 

2006, 345(3):1 059-1 067. 

101. Riederer BM, Pellier V, Antonsson B, Di Paolo G, Stimpson SA, Lutjens R, 
Catsicas S, Grenningloh G: Regulation of microtubule dynamics by the 
neuronal growth-associated protein SCG10. Proc Natl Acad Sci USA 1997, 
94(2)741-745. 

102. Ngo IT, Peng T, Liang XJ, Akeju O, Pastorino S, Zhang W, Kotliarov Y, 
Zenklusen JC, Fine HA, Marie D, ef a/: The Ip-encoded protein stathmin 
and resistance of malignant gliomas to nitrosoureas. J Natl Cancer Inst 

2007, 99(8):639-652. 

103. Liang XJ, Choi Y, Sackett DL, Park JK: Nitrosoureas inhibit the 
stathmin-mediated migration and invasion of malignant glioma cells. 
Cancer Res 2008, 68(13)5267-5272. 

104. Alii E, Bash-Babula J, Yang JM, Hait WN: Effect of stathmin on the 
sensitivity to antimicrotubule drugs in human breast cancer. Cancer Res 
2002, 62(23):6864-6869. 

105. Alii E, Yang JM, Hait WN: Silencing of stathmin induces tumor-suppressor 
function in breast cancer cell lines harboring mutant p53. 

Oncogene 2007, 26(7):1003-1012. 



Page 20 of 21 



106. Mistry SJ, Atweh GF: Therapeutic interactions between stathmin 
inhibition and chemotherapeutic agents in prostate cancer. Mol Cancer 
Ther 2006, 5(12)3248-3257. 

107. Muller FL, Colla S, Aquilanti E, Manzo VE, Genovese G, Lee J, Eisenson D, 
Narurkar R, Deng P, Nezi L, ef al: Passenger deletions generate therapeutic 
vulnerabilities in cancer. Nature 2012, 488(741 1)337-342. 

108. Nijhawan D, ZackTI, Ren Y, Strickland MR, Lamothe R, Schumacher SE, 
Tsherniak A, Besche HC, Rosenbluh J, Shehata S, ef al: Cancer vulnerabilities 
unveiled by genomic loss. Cell 2012, 150(4):842-854. 

109. Hanein S, Martin E, Boukhris A, Byrne P, Goizet C, Hamri A, Benomar A, 
Lossos A, Denora P, Fernandez J, ef al: Identification of the SPG15 gene, 
encoding spastizin, as a frequent cause of complicated 
autosomal-recessive spastic paraplegia, including Kjellin syndrome. 
Am J Hum Genet 2008, 82(4)392-1002. 

110. Sagona AP, Nezis IP, Bache KG, Haglund K Bakken AC, Skotheim Rl, Stenmark H: 
A tumor-associated mutation of FYVE-CENT prevents its interaction with 
Beclin 1 and interferes with cytokinesis. PLoS One 201 1, 6(3):e17086. 

111. Sagona AP, Nezis IP, Pedersen NM, Liestol K, Poulton J, Rusten TE, Skotheim 
Rl, Raiborg C, Stenmark H: Ptdlns(3)P controls cytokinesis through 
KIF13A-mediated recruitment of FYVE-CENT to the midbody. 

Nat Cell Biol 201 0, 1 2(4)362-371 . 

112. Pober BR: Williams-Beuren syndrome. N Engl J Med 2010, 362(3)239-252. 

113. Hobart HH, Morris CA, Mervis CB, Pani AM, Kistler DJ, Rios CM, Kimberley 
KW, Gregg RG, Bray-Ward P: Inversion of the Williams syndrome region is 
a common polymorphism found more frequently in parents of children 
with Williams syndrome. Am J Med Genet C Semin Med Genet 2010, 
154C(2):220-228. 

114. Osborne LR, Li M, Pober B, Chitayat D, Bodurtha J, Mandel A, Costa T, Grebe 
T, Cox S, Tsui LC, ef al: A 1.5 million-base pair inversion polymorphism in 
families with Williams-Beuren syndrome. Nat Genet 2001, 29(3)321-325. 

1 1 5. Tarn E, Young EJ, Morris CA, Marshall CR, Loo W, Scherer SW, Mervis CB, 
Osborne LR: The common inversion of the Williams-Beuren syndrome 
region at 7ql 1 .23 does not cause clinical symptoms. Am J Med Genet A 
2008, 146A(14):1797-1806. 

1 1 6. Aso T, Lane WS, Conaway JW, Conaway RC: Elongin (SIM): a 
multisubunit regulator of elongation by RNA polymerase II. Science 
1995, 269(5229):1439-1443. 

1 1 7. Conaway JW, Bradsher JN, Tan S, Conaway RC: Transcription factor SIN: 

a novel component of the RNA polymerase II elongation complex. Cell 
Mol Biol Res 1993, 39(4)323-329. 

118. Bradsher JN, Jackson KW, Conaway RC, Conaway JW: RNA polymerase II 
transcription factor Sill. I. Identification, purification, and properties. J Biol 
Chem 1993, 268(34)25587-25593. 

119. Kamura T, Burian D, Yan Q, Schmidt SL, Lane WS, Querido E, Branton PE, 
Shilatifard A, Conaway RC, Conaway JW: Mufl, a novel Elongin BC-interacting 
leucine-rich repeat protein that can assemble with Cul5 and Rbxl to 
reconstitute a ubiquitin ligase. J Biol Chem 2001, 276(32)29748-29753. 

120. Yasukawa T, Kamura T, Kitajima S, Conaway RC, Conaway JW, Aso T: 
Mammalian Elongin A complex mediates DNA-damage-induced 
ubiquitylation and degradation of Rpbl . EMBO J 2008, 27(24)3256-3266. 

121. Barr FG: Chromosomal translocations involving paired box transcription 
factors in human cancer. Int J Biochem Cell Biol 1997, 29(1 2):1 449-1 461 . 

122. Buckingham M, Relaix F: The role of Pax genes in the development of 
tissues and organs: Pax3 and Pax7 regulate muscle progenitor cell 
functions. Annu Rev Cell Dev Biol 2007, 23:645-673. 

123. Murdoch B, Delconte C, Garcia-Castro Ml: Pax7 lineage contributions to 
the Mammalian neural crest. PLoS One 2012, 7(7):e41089. 

124. Davis RJ, D'Cruz CM, Lovell MA, Biegel JA, Barr FG: Fusion of PAX7 to FKHR 
by the variant t(1;13)(p36;q14) translocation in alveolar 
rhabdomyosarcoma. Cancer Res 1 994, 54(1 1)2869-2872. 

125. Bennicelli JL, Advani S, Schafer BW, Barr FG: PAX3 and PAX7 exhibit 
conserved cis-acting transcription repression domains and utilize a 
common gain of function mechanism in alveolar rhabdomyosarcoma. 
Oncogene 1999, 18(30)4348-4356. 

126. Matsuda M, Kobayashi Y, Masuda S, Adachi M, Watanabe T, Yamashita JK, 
Nishi E, Tsukita S, Furuse M: Identification of adherens junction-associated 
GTPase activating proteins by the fluorescence localization-based 
expression cloning. Exp Cell Res 2008, 314(5):939-949. 

127. Parsons DW, Jones S, Zhang X, Lin JC, Leary RJ, Angenendt P, Mankoo P, 
Carter H, Siu IM, Gallia GL, ef al: An integrated genomic analysis of human 
glioblastoma multiforme. Science 2008, 321 (5897):! 807-181 2. 



Ray et al. BMC Genomics 201 3, 14:505 Page 21 of 21 

http://www.biomedcentral.com/1471-2164/14/505 



128. 



129. 



1 30. 



131. 



132. 



133. 



134. 



135. 



136. 



137. 



38. 



139. 



1 40. 



142. 



143. 



145. 



146. 



148. 



149. 



Parsons DW, Li M, Zhang X, Jones S, Leary RJ, Lin JC, Boca SM, Carter H, 
Samayoa J, Bettegowda C, et al: The genetic landscape of the childhood 
cancer medulloblastoma. Science 201 1, 331 (601 6):435-439. 
Grasso CS, Wu YM, Robinson DR, Cao X, Dhanasekaran SM, Khan AP, Quist MJ, 
Jing X, Lonigro RJ, Brenner JC, et al: The mutational landscape of lethal 
castration-resistant prostate cancer. Nature 2012, 487(7406)239-243. 
Jones S, Zhang X, Parsons DW, Lin JC, Leary RJ, Angenendt P, Mankoo P, 
Carter H, Kamiyama H, Jimeno A, et al: Core signaling pathways in human 
pancreatic cancers revealed by global genomic analyses. Science 2008, 
321(5897):1801-1806. 

Berger MF, Hodis E, Heffernan TP, Deribe YL, Lawrence MS, Protopopov A, 
Ivanova E, Watson IR, Nickerson E, Ghosh P, et al: Melanoma genome 
sequencing reveals frequent PREX2 mutations. Nature 2012, 
485(7399):502-506. 

Bernstein BE, Birney E, Dunham I, Green ED, Gunter C, Snyder M: An 
integrated encyclopedia of DNA elements in the human genome. 

Nature 2012, 489(7414)57-74. 

Ernst J, Kheradpour P, Mikkelsen TS, Shoresh N, Ward LD, Epstein CB, Zhang X, 
Wang L, Issner R, Coyne M, et al: Mapping and analysis of chromatin state 
dynamics in nine human cell types. Nature 201 1, 473(7345)43-49. 
Kalyana-Sundaram S, Kumar-Sinha C, Shankar S, Robinson DR, Wu YM, Cao 
X, Asangani IA, Kothari V, Prensner JR, Lonigro RJ, et al: Expressed 
pseudogenes in the transcriptional landscape of human cancers. 
Ce//2012, 149(7):1622-1634. 

Rinn JL, Kertesz M, Wang JK, Squazzo SL, Xu X, Brugmann SA, Goodnough 
LH, Helms JA, Farnham PJ, Segal E, et at Functional demarcation of active 
and silent chromatin domains in human HOX loci by noncoding RNAs. 

Ce//2007, 129(7):131 1-1323. 

Forbes SA, Bhamra G, Bamford S, Dawson E, Kok C, Clements J, Menzies A, 

Teague JW, Futreal PA, Stratton MR: The Catalogue of Somatic Mutations 

in Cancer (COSMIC). Curr Protoc Hum Genet 2008, 10:10-11. 

Forbes SA, Blndal N, Bamford S, Cole C, Kok CY, Beare D, Jia M, Shepherd R, 

Leung K, Menzies A, et al: COSMIC: mining complete cancer genomes in 

the Catalogue of Somatic Mutations in Cancer. Nucleic Acids Res 201 1, 

39:945-950. 

Sun L, Hui AM, Su Q, Vortmeyer A, Kotliarov Y, Pastorino S, Passaniti A, Menon 
J, Walling J, Bailey R, etal: Neuronal and glioma-derived stem cell factor 
induces angiogenesis within the brain. Cancer Cell 2006, 9(4)287-300. 
Edgar R, Domrachev M, Lash AE: Gene expression omnibus: NCBI gene 
expression and hybridization array data repository. Nucleic Acids Res 2002, 
30(1)207-210. 

Yuan M, Kendziorski C: A unified approach for simultaneous gene clustering 
and differential expression identification. Biometrics 2006, 62(4):1 089-1 098. 
Bauer S, Grossmann S, Vingron M, Robinson PN: Ontologizer 2.0-a 
multifunctional tool for GO term enrichment analysis and data 
exploration. Bioinformatics 2008, 24(1 4):1 650-1 65 1 . 
Zhang B, Kirov S, Snoddy J: WebGestalt: an integrated system for 
exploring gene sets in various biological contexts. Nucleic Acids Res 2005, 
33:W741-W748. 

Wuchty S, Arjona D, Bozdag S, Bauer PO: Involvement of microRNA 
families in cancer. Nucleic Acids Res 2012, 40(1 7):821 9-8226. 
Coons SW, Johnson PC, Schelthauer BW, Yates AJ, Pearl DK: Improving 
diagnostic accuracy and interobserver concordance in the classification 
and grading of primary gliomas. Cancer 1997, 79(7)4 381-1393. 
Reis-Filho JS, Faoro LN, Carrilho C, Bleggi-Torres LF, Schmitt FC: Evaluation 
of cell proliferation, epidermal growth factor receptor, and bcl-2 
immunoexpression as prognostic factors for patients with World Health 
Organization grade 2 oligodendroglioma. Cancer 2000, 88(4):862-869. 
VanDevanter DR, Yirdaw G, Do C, Tyssellng KA, Drescher CA, Forseth BJ, 
Von Hoff DD, McNutt MA: Preparation of uniform, intact DNA samples 
from resected tumor tissues for pulsed-field gel electrophoretic analyses. 
Biotechniques 1992, 13(6):884-887. 

Lim A, Dimalanta ET, Potamousls KD, Yen G, Apodoca J, Tao C, Lin J, Oi R, 
Skiadas J, Ramanathan A, et al: Shotgun optical maps of the whole 
Escherichia coli 0157:H7 genome. Genome Res 2001, 11(9)4 584-1593. 
Lim A: Single molecule systems: Advancements and applications to microbial and 
human genome analysis. PhD Thesis. University of Wisconsin-Madison; 2004. 
Reslewic S: The Optical Mapping of genomes: Gaining new insights on 
genomic structure and variation by single DNA molecule analysis. PhD Thesis. 
University of Wisconsin-Madison; 2005. 



150. 



151 



152. 



153. 



154. 



155. 



156. 



157. 



Zhou S, Herschleb J, Schwartz DC: Optical Mapping: a single molecule 
system for genome analysis. In New methods for DNA sequencing. 
2007:265-300. 

Schwartz DC, Cantor CR: Separation of yeast chromosome-sized DNAs by 
pulsed field gradient gel electrophoresis. Cell 1984, 37(l):67-75. 
Zhou S, Deng W, Anantharaman TS, Lim A, Dimalanta ET, Wang J, Wu T, 
Chunhong T, Creighton R, Kile A, et at A whole-genome shotgun optical 
map of Yersinia pestis strain KIM. Appl Environ Microbiol 2002, 
68(12):6321-6331. 

Anantharaman T, Mishra B, Schwartz D: Genomics via Optical Mapping: III: 
Contiging genomic DNA. Proc Int Conf Intell Syst Mol Biol 1999, 1999:18-27. 
Kent WJ, Sugnet CW, Furey TS, Roskin KM, Pringle TH, Zahler AM, Haussler D: 
The human genome browser at UCSC. Genome Res 2002, 12(6):996-1006. 
Lai W, Choudhary V, Park PJ: CGHweb: a tool for comparing DNA copy 
number segmentations from multiple algorithms. Bioinformatics 2008, 
24(7)4014-1015. 

lafrate AJ, Feuk L, Rivera MN, Listewnik ML, Donahoe PK, Qi Y, Scherer SW, 
Lee C: Detection of large-scale variation in the human genome. 

Nat Genet 2004, 36(9):949-951 . 

Untergasser A, Nijveen H, Rao X, Bisseling T, Geurts R, Leunissen JA: 
Primer3Plus, an enhanced web interface to Primer3. Nucleic Acids Res 
2007, 35:W71-W74. 



doklO.1 186/1471-2164-14-505 

Cite this article as: Ray et al: Discovery of structural alterations in solid 
tumor oligodendroglioma by single molecule analysis. BMC Genomics 

2013 14:505. 



Submit your next manuscript to BioMed Central 
and take full advantage of: 

• Convenient online submission 

• Thorough peer review 

• No space constraints or color figure charges 

• Immediate publication on acceptance 

• Inclusion in PubMed, CAS, Scopus and Google Scholar 

• Research which is freely available for redistribution 



Submit your manuscript at 
www. biomedcentra I .com/su bmit 



o 



BioMed Central 



